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FOLDING PROTEINS 

The present invention relates to the folding of 
proteins. In particular, the present invention provides 
proteins, which are able to form a complex in vitro, 
5 useful in facilitating folding of proteins', for example 
those produced using recombinant DNA technology. Genes 
encoding the proteins of the complex are also provided. 
The present invention further relates to methods of 
assembling a protein complex able to fold proteins in an 

10 in vitro environment. 

Molecular chaperones are knovm to be able to assist 
in the folding of proteins along the folding pathways 
from denatured state to correctly folded product 
(reviewed in [1] ) . One well studied class of molecular 

15 chaperones are the chaperonins (GroEL, Hsp60 and Rubisco 
subunit binding protein) found in eubacteria, 
mitochondria and plastids (reviewed in [2,3]) . They are 
14-mer double -torus structures composed of one (GroEL and 
Hsp60) or two (Rubisco subunit binding protein) subunit 

20 types and show seven-fold symmetry [4,5] , In vitro these 
chaperonins bind denatured proteins and upon ATP 
hydrolysis, release them into aqueous solution where they 
complete folding [6] . In vivo there is evidence that 
they are involved in the folding, transport and assembly 

25 of newly synthesized proteins. The original mutations 
isolated in groE affected the folding of bacteriophage 
particle subunits [7] but more recent genetic analysis 
suggests a more general role in protein biogenesis in E. 
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coli [8] . Hsp60 is involved in the import of proteins 
into the mitochondrial lumen from the cytoplasm [9] . 

Although no GroEL-like chaperonins have been 
identified in eukaryotic cytosol, the double- torus TCP-i- 
5 containing particle seems to be a component of the 

eukaryotic folding machinery and may play an analogous 
role to that of GroEL in eubacteria and the GroEL related 
chaperonins in symbiotic organelles. TCP-l is weakly 
related to the GroEL family [10] but shows nearly 40% 
10 identity to an archaebacterial chaperonin, TF55 [ii] . it 
has been proposed that GroEL and TCP-i are subfamilies 
derived from a primordial gene [10,12,13,14] and.it has 
been suggested that the eukaryotic TCP-l -containing 
chaperonin may have evolved "from an archaebacterial 
15 lineage [3,11] . 

Recently, purified chaperonin containing TCP-l has 
been shown to facilitate the folding of actin [15] and 
tubulin [16,17] in vitro and it binds newly- synthesized 
actin, tubulin and some other unidentified polypeptides 

20 in vivo [18] . One striking difference between the 

bacterially derived chaperonins and the TCP-l -containing 
chaperonin is the heteromeric nature of the TCP-1- 
containing particle [14,15,17,19]. There are at least 
five polypeptide species in the complex containing TCP-l 

25 [14,17]. 

To date, little sequence information has been 
available on the polypeptides which make up the complex, 
despite the fact that various parties have obtained the 
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sequences of peptides from a number of polypeptides of 
TCP complexes of different organisms. Frydman et al (17) 
demonstrated the presence of six subunits in bovine TCP 
complex, which they termed "TRiC" (TCP-1 ring complex) , 
5 and obtained some peptide sequence information indicating 
some resemblance both between polypeptides of the complex 
and between these polypeptides and those of other 
organisms. Rommelaere et al [59] looked at the cytosolic 
chaperonin from both rabbit reticulocyte lysate and 

10 bovine testis. They report finding eight different 
polypeptides in rabbit reticulocyte chaperonin, and 
obtaining partial amino acid sequences of all eight. 

However, full length clones have proved elusive. 
The full sequence of murine TCP-1 has been available 

15 since 1986 (20) and Ehmann et al (FEES, 336: 2, 313-316, 
1993) have reported the obtention of a TCP-1 related 
sequence from Avena sativa. (oat) seedlings. Despite this 
information being available, there has yet to be a report 
of the obtention of full-length nucleic acid sequences 

20 encoding the components of a mammalian TCP-1 complex. 
Knowledge of short peptide sequences derived from 
individual subunits of chaperonin containing TCP-1 has 
not enabled the specific cloning of the full-length cDNA 
for individual subunits. One problem is that in order to 

25 be sure a peptide sequence is derived from a subunit of 
the family, it must be identifiably homologous to the 
only full-length mammalian sequence available, ie TCP-1 
(20,48) . Any DNA sequences derived by reverse 
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translation from the novel peptide sequence will also be 
related to TCP-1 and the related gene sequences. If 
these sequences are used as PGR primers they will prime 
synthesis and amplification of many TCP-l related 
5 sequences, so further insight and activity are needed to 
identify the sequences which encode particular subunits 
of the complex. 

The present invention provides individually seven 
nucleic acid molecules with sequences encoding subunits 
10 of the TCP-l-containing chaperonin, different from the 

original Tcp-1 gene (reported in 20) . Since, in mice, at 
least three of the novel Tcp-l related genes are unlinked 
to the mouse t complex, it is proposed to rename the TCP- 
1 complex [14] as CCT, chaperonin containing TCP-1. Only 
15 now we have all the eight complete sequences of the 

ubiquitously expressed subunits is it possible to know 
the gene and subunit to which each PCR product 
corresponds. Likewise, all other TCP-1 related genes in 
the databases make no sense without the complete 
20 sequences being available. 

The present invention also provides molecules which 
are mutants, derivatives or alleles of any one of the 
seven sequences provided, particularly mutants, 
derivatives and alleles which encode a protein which 
25 retains a functional characteristic of the protein 

encoded by the respective wild- type gene, especially the 
ability to associate with at least another subunit to 
form a complex able to fold a polypeptide. Changes to a 
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sequence, to produce a mutant or derivative, may be by 
one or more of insertion, deletion or substitution of one 
or more nucleotides in the nucleic acid, leading to the 
insertion, deletion or substitution of one or more amino 
5 acids. Of course, changes to the nucleic acid which make 
no difference to the encoded amino acid sequence are 
included. We have demonstrated the existence of the 8 
gene sequences in yeast and in plants by hybridization 
(52, unpublished results) and 6 genes have been isolated 
10 from yeast by ourselves and others (53) . These 6 yeast 
genes correspond exactly (ie. they are the orthologues 
of) to six of the genes exemplified in this application 
CCTor, 13, 7, 6, e, 77 and f. We predict that all 
eukaryotic organisms contain at least the set of eight 
15 genes which we have described in mouse. There may be 

tissue specific CCT genes or additional CCT genes in some 
organisms but we would expect each of these to be closely 
related (greater than 70% amino acid sequence homology) 
to one of the eight genes described here. These eight 
20 CCT genes of mouse constitute the basic family which 

comprise the core CCT complex. In a preferred embodiment 
of the present invention, the sequence is one encoding a 
polypeptide found in a human or a mouse. 

The polypeptides may have an amino acid sequence 
^5 which shares a significant degree of homology with any of 
the specific sequences provided herein. Such homology 
may for example, be 60% or greater, 70% or greater, 80% 
or greater, 90% or greater or 95% or greater, provided 
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the polypeptide is able to function as a subunit of a 
complex able to fold a polypeptide. 

The sequences of polypeptides encoded by nucleic 
acid according to each of seven different embodiments of 
5 the present invention are provided in Figure 3, (a) to 

(f) and Figure 8 (h) Preferred nucleic acid sequences are 
shown in Figure 8 (b) to (h) . The present invention also 
provides a vector which comprises nucleic acid with any 
one of the provided sequences, preferably a vector from 
10 which polypeptide encoded by the nucleic acid sequence 
can be expressed. The invention further encompasses a 
host cell transfected with such a vector. The host cell 
may be bacterial or eukaryotic, such as yeast or 
mammalian eg murine. 
15 According to another aspect of the present invention 

there is provided a method of making a polypeptide 
component (subunit) of a complex, which complex is- able 
to fold a polypeptide, which method comprises expressing 
the polypeptide component from encoding nucleic acid, the 
20 component being other than TCP-1, (A nucleic acid 
sequence encoding TCP-1 is shown in Figure 8(a) .) 
Preferably the nucleic acid is one with a sequence shown 
in Figure 8 or a mutant, derivative or allele of such a 
sequence. Appropriate cof actors or accessory materials 
25 may be needed in order for the complex to function in 

some circumstances. Indeed, specificity of the complex 
for folding different polypeptides may be altered 
according to which cofactors are present and/or what 
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combination of polypeptide components ( "subunits" ) is 
used. 

The present invention also provides a polypeptide 
component of a complex able to fold a polypeptide, the 
5 polypeptide being substantially free of other 

polypeptides or substantially free of other polypeptides 
of the complex, and being other than TCP-1. Preferably, 
the polypeptide component, or a mutant or derivative 
thereof can be found in a mammal, most preferably in a 
10 mouse or a human. In a preferred embodiment of the 

present invention the polypeptide is one which has any 
one of the amino acid sequences shown in figure 8 (b) to 
(h) (see also Figure 3) . Modified polypeptides (eg 
mutants or derivatives) which have the ability to form a 
15 complex able to fold a polypeptide are encompassed by the 
present invention . 

The provision for the very first time of full-length 
sequences of nucleic acid molecules which encode 
polypeptides of a complex able to fold a polypeptide 
20 enables the production of such a complex using 

recombinant techniques . According to another aspect of 
the present invention there is provided a method of 
producing a complex able (under appropriate conditions) 
to fold a polypeptide, the method comprising expressing 
25 polypeptide components of the complex from encoding 
nucleic acid and causing or allowing assembly of the 
polypeptide components into the complex. Preferably at 
least some of the components are murine or human. 
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Preferably a polypeptide component of the complex is any 
one of those shovm in figure 8 (a) to (h) , or a mutant or 
derivative thereof. Polypeptide components of the 
complex may be expressed individually and then purified, 
5 with the assembly carried out in vitro by mixing CCT 

subunits in the appropriate combinations. On the other 
hand, subunits may be expressed together. 

Also provided by the present invention is a complex 
able to fold a polypeptide, the complex having been 
10 produced using recombinant DNA technology. The use of 

recombinant DNA technology in the production of a complex 
enables an intelligent selection to be made about which 
subunits to include, in those cases where not all of 
those provided are not required, and the easy assembly 
15 and purification of complexes with the required 

biological activity. This is facilitated still further 
by the provision of subunit specific antibodies, 
discussed below. The functional test for a useful 
complex is its ability to fold a polypeptide. Results 
20 shown below indicate that different combinations of 

subunits exist in vivo. Immunoprecipitation experiments 
may be used to determine subunit combinations in purified 
complexes. Furthermore, recombinant production of 
different combinations of subunits and in vitro testing 
25 of ability to bind substrate polypeptide (eg in a 

reticulolysate system) or ability to fold polypeptide (eg 
in an E. coli expression system or in a separate 
combination stage following earlier production of the 
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subunits) enables easy determination of subunit 
combinations which are functional. 

The present invention also encompasses the use of a 
complex in the folding of a polypeptide. Generally the 
5 complex will be one produced using recombinant DNA 

technology. As discussed, the complex may consist of all 
or less than all of the subunits provided herein. 
Similarly, the invention provides a method of folding a 
polypeptide which comprises causing or allowing such a 
10 complex to fold the polypeptide, following a previous 
step of production of the complex. In fact, it may be 
that the complex is produced at the same time as the 
polypeptide to be folded, eg by expression from the same 
vector and/or expression in the same host cell, rather 
15 than in a previous step. 

The provision by the present invention of the full- 
length -sequences of the subunits of a folding complex 
enables the production of subunits individually, with 
subsequent purification and combining to form a complex, 
20 the production of subunits together with combination of 
subunits taking place in the host cell culture (eg E. 
coll, yeast or Baculovirus systems) and with subsequent 
purification, or the expression of combinations of 
subunits with substrate in a host cell system (eg E, 
25 coli, yeast or Baculovirus systems) , thus constructing a 
folding factory in vivo. 

Until now, it has proved problematic to obtain 
antibodies which are specific for an individual subunit 
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from the complex, in the sense of distinguishing between 
one subunit and the others. The reasons for this are 
apparent: the subunit s share (varying degrees of) 
homology with one another. Accordingly, attempts to 
5 obtain subunit specific antibodies have failed. 

Immunisation of an animal with a purified subunit causes 
the production of antibodies which, in the most part, are 
cross -reactive with a number of subunits. Now that the 
present application has provided the full sequences of 
10 the subunits for the first time, it has proven . possible 
to identify regions of the stibunits which are 
sufficiently different from corresponding regions in the 
other subunits to enable the production of subunit 
specific antibodies. 
^5 Thus, according to another aspect of the present 

invention there is provided an antibody specific for a- 
CCT subunit (subunit of a complex able to fold a protein) 
other than TCP-1 (CCTof) , preferably a mammalian subunit, 
such as human or murine. In a preferred embodiment, the 
20 antibody is specific for an epitope of a peptide 

corresponding to a C-terminal portion of a subunit, or a 
variant of such a peptide (modified by any of insertion, 
substitution or deletion of one or more amino acids) 
Most preferred peptides for this purpose are as follows: 
25 APRKRVPDHHPC (CCTbeta) SEQ . ID NO. 1 



NRQTGAPDAGQE 



(CCTgamma) 



SEQ. ID NO. 2 



SILKIDDWNTR 



(CCTdelta) 



SEQ. ID NO, 3 



IDDIRKPGESEE 



(CCTepsilon) 



SEQ. ID NO. 4 
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SAGRGRGQARFH (CCTeta) SEQ. ID NO. 5 

SGKKDWDDDQND (CCTtheta) SEQ. ID NO. 6 

EIMRAGMSSLKG (CCTzeta) SEQ, ID NO. 7 

These peptides (including any variant modified in any 
5 way, provided the subunit specific nature of an antibody 
raised to the variant is retained) each represent an 
aspect of the present invention. 

According to a further aspect of the invention there 
is provided a method of obtaining an antibody specific 
10 for a CCT subunit, other than TCP-l (CCTor) , which method 
comprises immunising a mammal (eg mouse, rat, rabbit, 
horse, goat, sheep or monkey) with a peptide of the 
subunit. Preferably, the peptide is one of those listed 
above, or a peptide which is a variant of one of those 
15 listed, provided subunit specificity of an antibody which 
binds the peptide variant is retained. Antibodies may be 
obtained from immunised animals using a variety of 
techniques known in the art, and screened, preferably 
using binding of antibody to antigen of interest (one of 
20 the C- terminal peptides, for example) . For instance. 

Western blotting techniques or immunoprecipitation may be 
used (Armitage et al (1992) Nature 357: 80-82). 

As an alternative or supplement to immunising a 
mammal with a peptide., an antibody specific for a subunit 
25 may be obtained from a recombinantly produced library of 
expressed immunoglobulin variable domains, eg using 
lambda bacteriophage or filamentous bacteriophage which 
display functional immunoglobulin binding domains on 
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their surfaces; for instance see WO92/0104 7. The library 
may be "naive", that is constructed from sequences 
obtained from an. organism which has not been immunised 
with any of the peptides, or may be one constructed using 
5 sequences obtained from an organism which has been 
exposed to the peptide or peptides of interest. 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term "antibody" 
should be construed as covering any binding member having 
10 a binding domain with the required specificity. Thus the 
invention covers antibody fragments, derivatives, 
functional equivalents and homologues of antibodies. 

Example antibody fragments, capable of binding an 
antigen or other binding partner are the Fab fragment 
15 consisting of the VL, VH, CI and CHI domains; the Fd 
fragment consisting of the VH and CHI domains; the Fv 
fragment consisting of the VL and VH domains of a single 
arm of an antibody; the dAb fragment which consists of a 
VH domain; isolated CDR regions and F(ab')2 fragments, a 
2 0 bivalent fragment, comprising two Fab fragments linked by 
a disulphide bridge at the hinge region. Single chain Fv 
fragments are also included. 

A derivative is a substance derived from a 
polypeptide or antibody. The derivative may differ from 
25 a polypeptide from which it is derived by the addition, 
deletion, substitution or insertion of one or more amino 
acids, or the linkage or fusion of other molecules to the 
polypeptide. Changes such as addition, deletion. 
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substitution or insertion may be made at the nucleotide 
or protein level . 

A hybridoma producing a monoclonal antibody 
according to the present invention may be subject to 
5 genetic mutation or other changes. It will further be 
understood by those skilled in the art that a monoclonal 
antibody can- be subjected to the techniques of 
recombinant DNA technology to produce other antibodies or 
chimeric molecules which retain the specificity of the 

0 original antibody. Such techniques may involve 
introducing DNA encoding the immunoglobulin variable 
region, or the complementarity determining regions 
(CDRs) , of an antibody to the constant regions, or 
constant regions plus framework regions, of a different 

5 immunoglobulin. See, for instance, EP184187A, GB 

2188638A or EP-A-0239400 . Cloning and expression of 
chimeric antibodies are described in EP-A- 0120694 and EP- 
A-0125023 . 

The peptides for use in obtaining antibodies may be 
) made by any of a variety of techniques known to those 
skilled in the art. For instance, this may be by solid 
phase synthesis (Merrifield, JACS 85: 2159-2154 (1963)) 
or by techniques described in Bodanszky et al, Peptide 
Synthesis, second edition (Wiley, 1976) . Standard 

1 solution peptide synthesis methodologies, using chemical 
or enzymatic methods of amide bond formation may be 
employed. Commercial peptide synthesizing machines are 
available , 
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Conveniently, an amino terminal cysteine can be 
added to the chosen peptide in order that the peptide can 
be coupled to PPD through the cysteine using standard 
chemistry. 

5 Systems for cloning and expression of a polypeptide 

in a variety of different host cells are well known. 
Suitable host cells include bacteria, mammalian cells, 
yeast and baculovirus systems. Mammalian cell lines 
available in the art for expression of a heterologous 

10 polypeptide include Chinese hamster ovary cells, HeLa 
cells, baby hamster kidney cells and many others. A 
common, preferred bacterial host is £. coli. 

Suitable vectors can be chosen or constructed, 
containing appropriate regulatory sequences, including 

15 promoter sequences, terminator fragments, polyadenylation 
sequences, enhancer sequences, marker genes and other 
sequences as appropriate. For further details see, for 
example, Molecula^r Cloning: a Lahoratory Manual: 2nd 
edition, Sambrook et al, 1989, Cold Spring Harbor 

20 Laboratory Press. Transformation procedures depend on 
the host used, but are well known. 

Baculovirus expression systems are commercially 
available in kit form, eg MaxBac™. The techniques 
employed are described in Summers and Smith, Texas 

25 Agricultural Experiment Station Bulletin 1555 (1987) . 
See also The Molecular Biology of Baculovlruses (ed 
Doerfler, 1986) for a general overview of Baculovirus. 
Baculovirus expression vectors have been developed for 



wo 95/20654 PCT/GB95/00192 

15 

infection into a number of cell -types, including those 
from Aedes aegyptx, Autograph calif arnica, Bornhyx mori, 
Drosophila melanogaster, Spodoptera frugiperda and 
trichoplusia ni. 
5 Embodiments of the present invention are now 

described further by way of illustration, with reference 
to the figures identified below. All documents mentioned 
in the text are hereby incorporated by reference. 
Figure 1 shows one- and Two-dimensional gel 

10 electrophoresis of mouse CCT and bovine TRiC. (a) SDS- 
PAGE analysis (8% gel) of mouse testis CCT. TCP-1- 
containing particles purified by sucrose gradient 
fractionation followed by ATP-affinity column 
chromatography from mouse testis [14] were 

15 electrophoresed. (b-c) 2D-PAGE analysis of mouse CCT 
(same sample as Figure la) and bovine TRiC [17] (kind 
gift of F.-U. Hartl) . Subunits of TCP- 1- containing 
particles purified by sucrose gradient fractionation 
followed by ATP-affinity column chromatography from mouse 

20 testis (b) and bovine testis (c) were separated by lEF 
followed by SDS-PAGE, (d) 2D-NEPHGE analysis of bovine 
TRiC, Subunits of bovine TRiC were separated by NEPHGE 
followed by SDS-PAGE. (e) 2D-NEPHGE analysis of 
antibody-affinity purified CCT. TCP- 1 -containing 

25 particles were purified from ^^S-labelled F9 cell extracts 
using monospecific monoclonal antibodies to TCP-l. 
Proteins were visualized by silver staining (a-c) , 
Coomassie staining (d) or autoradiography (e) . Arrowhead 
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indicates a 45 kDa protein of pi 6.5 which co-purified 
with F9 cell CCT. Arrows show Hsp70 proteins, 
(f) Molecular weights and pi values of the polypeptides 
comprising mouse CCT and bovine TRiC . These values are 
5 determined from 2D-gel analysis with both pi and 
molecular weight markers. 

Figure 2 shows an alignment of peptide sequences 
deduced from 5' -end DNA sequences of 16 Tcp-l related 
genes (SEQ. ID NO'S 8-23). Clone names are used except 

10 for Tcp~l genes of mouse, human and yeast in this figure. 
Letters in parentheses indicate the source species: M, 
Afus musculus (mouse) ; H, Ho;no sapiens (human) ; C. 
Ccienorhabditis elegans (nematode) ; S, Saccharomyces 
cerevisiae (yeast) . Sources of the nucleotide sequences 

15 are as follows: mouse Tcp-I [20,48]; human Tcp-1 [48]; 
yeast Tcp-1 [27]; 4950 [28]; pGl-pG4 (partial 5' -end 
sequences) [22] and this report; p3 83 and p3 84 (partial 
5' -end sequences), pTS2, pTy?, pT62, pTeS, pTfl2 and 
pCBIiSO, this report. Although three additional human 

20 Tcp'l related partial DNA sequences, pAP3 (A. Malik et 

al., manuscript in preparation), HTR3 [41] and IB713 [49] 
are known, these sequences are not added to this figure 
due to absence of 5' -end sequences (pAP3 and HTR3) or 
inaccuracy of sequence (IB713) . However, judging by DNA 

25 sequence homology, IB713 , pAP3 and pHTR3 are members of 
groups 2, 5 and 6, respectively. 

Figure 3 shows amino-acid sequences of S, 6, 
and 77 subunits of mouse CCT. Amino-acid sequences of 
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mouse CCTS (a) (SEQ. ID NO. 25) , CCTy (b) (SEQ. ID NO. 
26), CCT5 (c) (SEQ. ID NO. 27), CCTe (d) (SEQ. ID NO. 
28), CCTT (e) (SEQ. ID NO. 29) and CCT77 (f) (SEQ. ID NO, 
30) deduced from nucleotide sequences of cDNA clones 
5 pTS2, pTt7, pT62, pTe5, pTi-12 and pCBL80 respectively. 

Sequences of tryptic peptides derived from B1-B4 of mouse 
CCT (Figure la) are shown by bold underlining. Thin 
underlines indicate amino acids identical to those of 
bovine P1-P3 and P5 ( [17] and Figure 1) determined from 
10 tryptic polypeptides of the TRiC subunits ( [17] , and 

unpublished results of F.U, Hartl et al . ) . An amino-acid 
position not determined by peptide sequencing is 
indicated by an 'x'. A bovine tryptic peptide sequence, 
P5/T36, contains 3 amino-acid differences compared to the 
15 mouse sequence and these are shown underneath. 

Figure 4 shows alignment of amino acid sequences of 
eight mouse CCT subunits and the homo-oligomeric V- 
archaebacterial chaperone TF55. A dash indicates an 
amino-acid gap. Conserved amino acids are indicated by 
20 bold characters. Consensus amino acids of these proteins 
are shown under the alignment (SEQ. ID NO. 33) . The 
amino acid sequence of CCT5 is SEQ. ID NO. 31) . The 
amino-acid sequences of mouse CCTa (TCP-1) [20,48] (SEQ. 
ID NO. 24) and archaebacterial chaperone TP55 of 
25 Sulf Globus shihsLtae [11] (SEQ. ID NO. 32) are derived 
from previous publications. 

Figure 5 shows comparison between a consensus motif 
of CCT subunits and the ATP binding motif of cAMP 
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dependent kinase and other kinases in this family. 
Amino-acid sequences around a consensus motif of CCT 
subunits (positions 102-115 in Figure 4) and the ATP 
binding motif of cAMP dependent kinase and related 
5 kinases [25,26] are compared. Amino acids conserved 
between these two groups are indicated by bold 
characters. (CCTor-SEQ, ID NO. 34, CCTjS-SEQ. ID NO. 35, 
CCTy-SEQ, id no. 36, CCT6-SEQ. ID NO, 37, CCTe-SEQ. ID 
NO. 38, CCTr-SEQ. ID NO. 39, CCT77-SEQ. ID NO. 40, CCT^- 
10 SEQ. ID NO. 41) cAPK-or, cyclic -AMP -dependent kinase or, 

(SEQ, ID NO. 42); PKC-a, protein ^kinase c-a (SEQ. ID NO. 
43) ; CaMII-Q?, bovine calcium- calmodulin-dependent kinase 
type Ilof, (SEQ. ID NO. 44) SNFl, a budding-yeast wild- 
type gene product for sucrose non-f emnenting mutant (SEQ. 
15 ID NO. 45) ; cdc2*, a fission-yeast cell-division-cycle 
gene product {SEQ. ID NO. 46); CDC7, a budding-yeast 
cell-division-cycle gene product (SEQ. ID NO. 47); Raf, a 
human cellular homologue of murine sarcoma virus product 
(SEQ. ID NO. 48) ; Src , a human cellular homologue of Rous 
20 avian sarcoma virus product (SEQ. ID NO. 49) ; Abl, a 
human cellular homologue of murine leukaemia virus 
product (SEQ. ID NO. 50); EGFR, human epidermal growth 
factor receptor (SEQ. ID NO, 51); INSR, human insulin 
receptor (SEQ. ID NO. 52); PDGFR, mouse platelet -derived 
25 growth factor receptor (SEQ. ID NO. 53) . 

Figure 6 shows hybridization analysis of the Cct 
genes of mouse and yeast. (a) Southern analysis of mouse 
Cct genes. Mouse 129/Sv liver DNA (10 ^g/lane) was 
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digested with Hindlll, electrophoresed on a 0.7% agarose 
gel and blotted onto a nylon membrane. The membrane was 
cut into 7 pieces and each of them hybridized with ^^P- 
labelled mouse Ccta/Tcp-1 (clone pTlbll) [33] , Cc tJb 
5 (pTS2) , Cctgr (pT77) , Cctd {pT62), Ccte (pTeS) , Cctz 
(pTfl2) and Ccth (pCBLSO) 1 . 5kb cDNA probes. These 
membranes were washed in 0 . Ix SSC supplemented with 0 . 1% 
SDS at 65 (b) Southern analysis for yeast homologues 
of Cct genes with mouse cDNA probes, Saccharomyces 

10 cerevisiae DNA (1-5 /zg/lane) was digested with Pstl. The 
DNA was electrophoresed, blotted and hybridized as 
described in panel (a) . After the hybridization, these 
membranes were washed in 2 x SSC supplemented with 0.1% 
SDS at 58.5°C. Positions of molecular weight markers are 

15 shown on the left side of each panel. 

Figure 7 shows the evolutionary tree of CCT subunits 
of eukaryotes and the homo- oligomer ic chaperonin of the:^ 
archaebacterium Sulfolobus shiJbatae, TF55. An 
evolutionary tree based on amino acid substitutions is 

20 constructed by the neighbour- joining method [50] with the 
aligned amino-acid sequences numbers 18-67, 70-157, 169- 
200, 213-241, 251-272, 274-310, 326-372, 376-388, 398-505 
and 521-566 (472 amino acids for each subunit) of mouse 
CCT subunits and archaebacterial chaperone TF55 in Figure 

2 5 4, and corresponding amino acid sequences of other 

homologues. Amino-acid sequences of human [48], fruit 
fly [51] , plant [2 9] and yeast [27] homologues of 
CCTa/TCP-1 were derived from previous publications. The 
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amino- acid sequence of the yeast homologue of CCTS was 
obtained from Miklos et ai . [28] . Capitals in 
parentheses indicate source species: MM, Mus musculus 
(mouse) ; HS, Homo sapiens (human) ; DM, Drosophila 
5 inelanograster (fruit fly) ; CE, CaenorhaJbdi tis elegans 
(nematode); AT, Arabidopsis thallana (plant); SC, 
Saccharomyces cerevisiae (yeast); SS, Sulfobus shibatae 
(archaebacterium) . 

Figure 8 shows full nucleotide sequences encoding 
10 subunits of a complex, in this case mouse CCT cDNAs . The 
polypeptide sequences of the predicted open reading 
frames are given under each DNA sequence. Figure 8(a) is 
the sequence of Tcp-la/Ccta (SEQ, ID NO. 54). Figures 8 
(b) (SEQ. ID NO. 55), (c) (SEQ. ID NO. 56), (d) (SEQ. ID 
15 NO. 57), (e) (SEQ. ID NO. 58), (f) (SEQ. ID NO. 59), (g) 
(SEQ. ID NO. 60) and (h) (SEQ. ID NO. 61) are the 
sequences of the Cctb, Cctg, Cctd, Gate, Cctz, Ccth and 
Cctq genes respectively. 

Figure 9 shows immunoprecipitation of CCT complexes 
20 from mouse F9 cells, precipitated under native conditions 
(14) using anti -peptide antibodies specific to CCTS, 
CCT7, CCTe and CCT17 . A negative control shows absence of 
signal using an anti -peptide antibody to 

Schizosaccharomyces pombe CCTor, which does not recognise 
25 any mammalian subunits. 

Figure 10 2D-PAGE analysis of CCT. Subunits of ATP- 
af f inity purified CCT from mouse testis were separated by 
2D- PAGE and proteins were visualised by silver staining 
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(A) . Greek letters indicate the eight subunit species 
whose genes have been cloned, and S6 shows the testis- 
expressed subunit. A novel co-purifying 63kDa protein of 
pi 6.93 is indicated by an arrow and hsp 70 proteins are 
5 shown by arrowheads, CCT subunits were immunoblotted 
with rabbit antibodies against carboxy terminal amino 
acid sequences of CCTfl (B) , CCTe (C) , CCT^S (E) , CCT7 (F) , 
CCTf (G) , CCTt? (H) , CCT6 (I), and a monoclonal antibody, 
91a, against CCTor (D) . Panel J shows CCT subunits 
0 immunoblotted with a rabbit antibody to a chaperonin 

consensus sequence thought to be involved in ATP-binding. 
Sequences of the peptide immunogens are shown in Table 2. 
In all panels the acidic side is to the left. 

Figure 11 Analysis of native populations of CCT. 
5 Partially purified CCT from mouse testis was subjected to 
non- denaturing isoelectric focusing followed by SDS- 
PAGE. Proteins were visualised by silver staining (A) . 
Native populations of CCT resolved by non -denaturing lEF 
are indicated by I and II and arrowheads show hsp 70 
) proteins. Native populations of CCT were immunoblotted 
with a rabbit antibody against CCTe (B) , a 3 0kDa 
proteolytic fragment of CCTe is indicated by an arrow, or 
a monoclonal antibody against /3- tubulin (C) . In all 
panels the acidic side is to the left. 
'> FIGURE 12 shows in vitro translation products 

produced from individual CCT cDNA clones. Analysis of 35 
S-labelled CCT subunits synthesized in rabbit 
reticulocyte lysate from individual CCT cDNA clones. 
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Aliquots of the in vitro translation reactions are 
electrophoresed on 10% SDS-PAGE and visualized by 
autoradiography. Lanes: (1) CCTa, (2) CCT^, (3) CCTy, 
(4) CCT6, (5) CCTe, (6) CCTf, (7) CCTt; , (8) CCTfl . M- 
5 molecular weight markers . 

FIGURE 13 shows detection of labelled CCT complex. 
Figure 13A shows an autoradiogram of a 10% SDS- 
polyacrylamide gel of 3 5S- labelled CCT proteins detected 
in fractions from a sucrose density grandient analysis of 
10 CCT assemblec in- vitro in rabbit reticulocyte lysate. 
Eight CCT subunit mRNAs, CCTa-CCT5 were translated 
together and the reaction mix applied to a 10%-40% 
sucrose gradient as previously described (14) . Lane M = 
marker proteins, lanes 1-14 contain 30 microlitre 
15 aliquots of 1ml sucrose fractions (fractions 17-4) 
Fraction 17 is the first, lightest fraction of the 
gradient. Free CCT subunits are observed in fractions 16 
and 15 (lanes 2 and 3) and the assembled complex is found 
in fractions 12 and 11 (lanes 6 and 7) . 
20 Figure 13B shows an autoradiogram of a 10% SDS- 

polyacrylamide gel of 3 5S- labelled CCT proteins 
immunoprecipitated from sucrose gradient fractions by 
anti-CCT^ antibodies. This analysis, demonstrates the. 
presence of assembled . CCT in sucrose fractions 12 and 11, 
25 but not in fractions 16 and 15 , Immunoprecipitation of 
aliquots of fractions from the same sucrose gradient as 
in Figure 13A. 300 microlitre aliquots of each of 
gradient fractions 17-9 were immunoprecipitated in a 1ml 
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reaction containing 5 microlitres of anti-CCT/S 
antibodies. Immune -complexes were recovered by binding 
to protein-A sepharose beads. Precipitation of fractions 
.16 and 15 with anti-CCT/S antibodies only recovers CCT/3 
5 from the mixture of 8 free, unassembled CCT subunits in 
the light fractions. However, precipitation of fractions 
12 and 11 recovers all subunits consistent with the 
notion that these sucrose fractions contain assembled CCT 
of expected size and sedimentation properties. 

10 

Sxunmary of materials and methods used in obtaining gene 
sequences . 

Puriflca,tion of CCT 

Mouse testis CCT was purified by sucrose gradient 
15 fractionation followed by ATP-agarose affinity 

chromatography as previously described [14] . Bovine 

testis TCP-1 ring complex (TRiC) [17] was a ]cind gift of 

F-U Hartl, Mouse F9 cells were labelled with ^^S-. 

methionine and ^^S-cysteine for 5h and CCT was purified by 
20 immunoaf f inity chromatography using anti-TCP- 1 monoclonal 

antibodies 84A and 91A [14,21] . Isoelectric focusing 

(lEF) [43] and non-equilibrium pH gradient 

electrophoresis (NEPHGE) [44] were carried out according 
to Corbet t and Dunn [4 5] . 
25 Peptide sequencing of CCT subunits 

CCT was subjected to electrophoresis on an 8% SDS- 
PAGE gel and protein bands were stained with 0.1% 
Coomassie Brilliant Blue in 0.5% acetic acid/10% aqueous 
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methanol and destained with 10% aqueous methanol. Bands 
, as shown in Figure la were excised and the proteins were 
electroeluted from the gel slices. The proteins were 
concentrated onto polyvinyl idene difluoride (PVDF) 
5 membranes using ProSpin cartridges (ABI) and each sample 
was digested with trypsin for 18h at 3 7°C using the 
method described by Fernandez et al, [46] . Digestion was 
terrninated with trif luoroacetic acid (TFA) and tryptic 
peptides were isolated from the PVDF supematants using 
10 an ABI 130A microbore separation system. TFA buffers 
(A: 0.1% TFA; B: 0.085% TFA, 70% acetonitrile) and a 
simple linear gradient (5 to 100% B in 50 min; 200 /xl per 
min) were employed on a 2 . 1 x 100 mm Brownlee Aquapore 
RP-30G column. Sequence analysis on the isolated 
15 fragments was performed using an ABI 477A protein 

sequencer with on-line PTH detection and data handling. 

Eight TRiC peptide sequences were published by 
Frydman et al . [17] and a further TRiC peptide sequence 
(P2) was a kind gift of F-U. Hartl . 
20 C. elegsLTis Tcp-1 related cDNA clones 

Nine Tcp-l related cDNA clones were reported from 5'- 
end single-pass sequencing of randomly selected C. 
elegans cDNA clones [22] . We compared these 9 partial 
sequences and found that they were derived from 4 
25 independent genes. Because the cDNAs were all 

direct ionally cloned, we subcloned the cDNA having the 
longest 5' region for each of the four genes (cmOBglO, 
cml2blO, cmlldB and cmlSeB) . These pBluescript KS* clones 
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were renamed pGl , pG2 , pG3 and pG4 and represent 1.8 )cb 
inserts for each of these four C. elegans genes. We 
accurately sequenced the 5' ends of these plasmid . 
subclones and deduced 90 amino acids of NH2- terminal 
5 sequence for each gene (Figure 2) . The deduced peptide 
sequences of pGl, pG2, pG3 and pG4 have 74%, 31%, 52% and 
48% identity to mouse TCP-1 [20] at their NH2 termini . 
Human Tcp-1 related cDNAs 

A cDNA library of the human cell line HT1080 (kindly 

10 given by P, Mitchell) contains cDNA inserted at the EcoRI 
site of bacteriophage lambda 2APII, These phages (1200 
pfus) were plated onto each of 24 plates and phage pools 
of each of them were made by incubating each lawn in 5 
mis of SM buffer overnight . Approximately 1 . 5 mis of 

15 pooled phage stock was obtained from each plate. A mixed 
primer for the peptide sequence TNDGATI (a motif 
conserved between TCP-1 and TF55) was designed (SEQ. ID 
NO. 62) : 

5 ' - AT (AG) GT ( AGT) GC (ACT) CCATC (AG) TT (AGT) GT- 3 ' . 

2 0 In and out PGR using the specific primer and a primer 

of the lambda ZAPII polylinker on the 5' side of the 
inserts was performed. PCR reactions (60 mM KCl, 15 mM 
Tris-HCl pH8.8, 2.25 mM MgCl2, 250 fxM each dATP, dGTP, 
dCTP and dTTP, 0.4 pmol//il each primer, 25 ^1/tube) were 

25 prepared and then 1 /xl of the phage stock and 1.5 unit 
Taq polymerase was added to each tube . These were 
subjected to 30 cycles of 93°C for 15 sec, 45*^0 for 15 
sec and 72 ^C for 3 0 sec. PCR products from 2 0 of the 
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pools gave one or only a few 150-3 00 bp bands by 1% 
, agarose gel electrophoresis. These bands were excised 
from the gel, reamplified and sequenced. Three Tcp-1 
related sequences were identified, subcloned into 
5 pBluescript KS* and named p383, p384 and phl3 . The 

deduced polypeptide sequence from p383 has 28% identity 
to mouse TCP-1 over its NHj- terminal 54 amino acids 
(excluding the primed sequence TNDGATI (SEQ, ID NO. 63)), 
that from p3 84 has 41% identity to mouse TCP-1 over its 

10 NH2- terminal 51 amino acids (excluding TNDGATI) and that 
from phl3 has 24% identity to mouse TCP-1 over its NH2- 
terminal 65 amino acids (excluding TNDGATI) . 

A partial cDNA clone, pAP3 , was isolated from an 
adult human kidney cDNA library by hybridisation with a 

15 PGR fragment generated using redundant primers 

corresponding to Na/K transporter ATPase sequences 
(Malik, manuscript in preparation) . Upon sequencing, 
pAP3 turned out to contain a 1050 base pair insert 
encoding a Tcp-1 related gene. Total mouse brain cDNA 

20 was subjected to PGR using two primers from the human 
pAP3 sequence : 

forward 5 ' - ACATCCAGCAGCTCTGTGAG- 3 ' ( SEQ , ID NO . 64 ) 

and 

reverse 5 ' -CCTTGCCTAGCACTCACTCC-3 ' (SEQ. ID NO. 65), 
25 An 800 bp fragment of the mouse orthologue of pAP3 gene 
was generated. 

Isolation of mouse Tcp-1 related cDNA clones 

The mouse full-length cDNA clones encoding 6 novel 
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TCP-1 related proteins were cloned as follows. An F9 
embryonal carcinoma cell cDNA library was made in lambda 
ZAP by the method of Nagata et al . [47] and 7.5x10^ size- 
selected recombinants were obtained from 1 /xg poly (A) * 
5 RNA. This library was transferred onto GeneScreen Plus 
(NEN) membranes and duplicate filters were sequentially 
probed with the following five Tcp-l related cDNA probes . 
The two C. elegans cDNAs, pG2 and pG3 , the three human 
cDNAs, p383, p384 and phl3 and the 800 bp mouse RT-PCR 
10 product from pAP3 were ^^P-labelled and hybridized to the 
membranes in Southern hybridization (SH) buffer (6x SSPE, 
5x Denhart's reagent, 0.5% SDS, 10 mg/ml salmon sperm 
DNA) overnight at 55 °C for the two C. elegans probes, 
60°C for the three human probes, and 65^0 for the mouse 
15 probe. Between 6-12 positively hybridizing clones were 
purified for each probe. The longest clone (1.8-2.2 kb 
inserts) derived from each screening was named as pTfi2.. 
(p383 homologue) , pTe5 (p384 homologue) , pT01 {phl3 
homologue) , pTri2 (pG2 homologue) , pT52 (pG3 homologue) 
20 and pTyl (pAP3 homologue) . The relationship between 

these probes and clones is described in Figure 2 . The 
sixth mouse Tcp-1 related cDNA was recovered during a 
pilot experiment for a mouse testis cDNA sequencing 
project. Ninety-five random mouse testis cDNAs 
25 directionally cloned in lambda ZAPII were single pass 

sequenced from their 5' ends. The sequence of the 80th 
cDNA revealed a new Tcp-I related gene which is different 
to all the others described above and this plasmid. 
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pCBLSO, was completely sequenced. 
DNA sequencing and computer analysis 

The nucleotide sequences of all the Tcp-1 related 
genes, except pCBLSO, were determined by the 
5 dideoxynucleotide chain termination method with fluor- 
escent ly labelled primers using the PRISM kit and a 3 73A 
automated sequencer (ABI) . Clone pCBL8 0 was manually 
sequenced by dideoxynucleotide chain termination and ^^S- 
dATP using a combination of deletion clones and specific 

10 primers. All these sequences and deduced amino-acid 

sequences were analysed by UWGCG programs on the Silicon 
Graphics Crimson network at the SERC, Daresbury, UK. 
Phylogenetic analyses were also carried out at Daresbury 
using the PHYLIP collection of programs. 

15 Southern blotting 

Mouse strain 129/Sv DNA (10 /xg/lane) and S. 
cerevisiae strain 3a DNAs (1.5 /xg/lane) were digested 
with Hindlll and electrophoresed on 0.7% agarose gels and 
blotted onto GeneScreen Plus membranes. Probes of 1500 ± 

20 3 bp length were made by PCR with the mouse Tcp-1 cDNA 

(pTlbll) [33] and mouse Tcp-l related cDNAs (pTS2, pT 7, 
pT62, pTeS, pT 12 and pCBL8 0) . These 7 probes were 
labelled with ^^P and hybridized to the blotted membranes 
in SH buffer overnight at 65*=*C (mouse) or at 55 °C 

25 (yeast) . The membranes were washed either in 0 . Ix SSC 
containing 0.1% SDS at 65°C (mouse) or in 2x SSC 
containing 0.1% SDS at 58.5°C (yeast). 
Results 
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Suhunit composition of the chaperonin containing- TCP~1 
(CCT) 

Two-dimensional gel analysis of the subunits of the 
CCT reveals 7-9 spots of 52-65 JcDa. Figure 1 shows SDS- 
5 PAGE, 2D- PAGE and 2D-NEPHGE comparison of CCT isolated 
from mouse testis and F9 embryonal carcinoma cells, and 
bovine testis TCP-1 ring complex (TRiC) [17] . 

Excluding the Hsp70 species which co-purify 
specifically with the CCT [14] , the testis preparations 

10 contain 9 major species (Figure Ib-d) and the F9 cell 

preparation contains seven 53-65 JcDa species excluding a 
4 5 IcDa protein (Figure le) . This represents an increase 
in the number of species reported by Lewis et ai . [14] 
because two species (S8 and" S9) , with basic pis, resolve 

15 poorly in the lEF dimension of 2D- PAGE run to 

equilibrium. A 2D-NEPHGE analysis of bovine TRiC clearly 
shows 9 subunit species (Figure Id) . 

We had previously shown, using a monoclonal antibody 
OlA) [21] which recognizes only a single subunit (S3, 

20 TCP-1) of the CCT, that antibody-affinity purified CCT 

has a similar electrophoretic profile to CCT purified by 
ATP-agarose affinity or ion-exchange chromatography, on 
SDS-PAGE gels [14] . Figure le shows a 2D-NEPHGE analysis 
of antibody-affinity purified, CCT from F9 cells; the 

25 polypeptide spot profile strongly resembles CCT 

biochemically purified from testis (Figure la,b) and 
seven 52-65 kDa species are distinguishable by the 2D 
analysis. A 45 kDa protein of pi 6.5 was co-purified 
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with CCT by the antibody-af f inity method, suggesting a 
novel CO- factor or substrate of CCT (arrow head in Figure 
le) . These results confirmed that TCP-l comprises a 
hetero-oligomeric complex with other polypeptides. 
5 Figure If shows a tabulation of the apparent 

isoelectric points and molecular weights of the CCT 
subunits. Frydman et al. [17] reported that bovine 
testis TRiC migrated as 5 bands on a 12.5% SDS-PAGE gel 
(P1-P5) . We found that band P4 contained 5 species of 
10 polypeptides which were distinguishable from one another 
on a 2D gel, and we named them P4.1, 4.2, 4.3, 4.4 and 
4 . 5 according to ascending pi . The subunits of mouse CCT 
(S1-S9) are numbered according to ascending pi using the 
mouse testis pattern as the standard- These bovine and 
15 mouse spots are distributed from pH6 . 1 - pH7.1 and MW 65 
kDa - MW52 kDa except for the most basic spot (NEPHGE 
analysis suggests a value of around pH 7 . 4 for the pi of 
this most basic spot) , 

Isolation of mouse cDNAs encoding seven novel TCP-l 

2 0 related proteins 

We previously reported [14] the existence, in various 
eukaryotes, of numerous genes and partial cDNA sequences 
related to the original mouse Tcp-1 gene [20] , the first 
identified gene encoding an CCT subunit [14,17] . It 

25 proved necessary to use a combination of methods to 

isolate murine Tcp-l related genes and clone the full- 
length cDNAs which encode the seven novel TCP-l related 
proteins described herein. The first and second 
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(clones pT62 and pTfl2) were isolated by cross- 
hybridization with Caenorhahditis elegans cDNA probes 
recovered from the 5' expressed sequence tag collection 
of Waterston et al. [22] , 
5 The third, fourth and seventh (clones pTS2, pTeS and 

pT^l) were isolated by cross -hybridization with human 
probes recovered by PGR of human HT1080 cell line cDNA 
using degenerate primers from a conserved region of TCP-1 
and TF55. 

10 The fifth gene (clone pCBLSO) was isolated during a 

mouse testis cDNA sequencing project at the Chester 
Beatty Laboratories, London. A cDNA fragment of a Tcp-l 
related gene (clone pAP3) was recovered accidentally from 
a human kidney cDNA library ' during a screen for ion 

15 transport channel genes (Malik et al . , manuscript in 

preparation) and the sixth murine gene (clone pT77) was 
recovered by hybridization with a mouse cDNA PGR product 
which was made with primers derived from the sequence of 
this human cDNA. 

2 0 Figure 2 shows the very conserved NH2- terminal domain 

deduced from these various Tcp-l and Tcp-l related genes 
of mouse, human, C. elegans and Saccharomyces cerevislae. 
A phylogenetic analysis (UPGMA) [23] based on the amino- 
acid sequences of the domain showed seven groups of genes 

25 (data not shown) , suggesting that in eukaryotes Tcp-l and 
the Tcp-l related genes have each evolved independently 
since they diverged. Analysis of the eight members of 
the Cct gene family suggests that the heteromeric CGT 
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particle had already evolved in a common ancestral 
organism of animals, plants and yeast more than 500 
million years ago and that different subunits may have 
different functions in protein folding,. 
5 Figure 3 and Figure 8 show amino-acid sequences of 

the seven mouse TCP-l related proteins deduced from their 
full-length cDNA clones. 

Figure 8 (b) to (h) shows the DNA sequences of the 
seven genes and the amino acid sequences (of the deduced 
10 open reading frames) underneath. 

The seven Tcp-l related cDNAs encode CCT suhxiaits 

Internal tryptic peptides derived from an SDS-PAGE 
gel separation of mouse testis CCT (B1-B4, Figure la) 
were sequenced because the NH2 termini of every subunit 
15 appeared to be blocked yielding no sequence data. 

Firydman at al . [17] obtained some internal peptide 
sequences from four of the bovine testis TRiC subunits 
(PI, P3, P4a/TCP-l and P5) and they also made available 
to us unpublished peptide sequences which we show here 
20 correspond to TRiC subunit P2 . Comparisons between these 
mouse and bovine peptide sequences and the predicted 
peptide sequences of the seven Tcp-1 related cDNAs 
reveals that they each encode particular CCT subunit 
proteins (Figure 3, Figure 8 (h) , Table 1). We named 
.25 these 7 new genes as Cctb (pTS2) , Cctg (pT'y7) , Cctd 

(pT62) , Ccte (pTe5) , Cctz (pTfl2), Ccth (pCBLSO) and Cctq 
(pT01)and their encoded proteins as CCT S, 7, 6, 77, 
and e subunits respectively. We suggest renaming the 
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Tcp-1 gene [20] as Cctcx and the TCP-1 protein as CCTor. 
Table 1 summarizes the correspondence between the 
proteins encoded by Cct genes and the spots of CCT 
subunits separated by 2D-gel electrophoresis (Figure 1) . 
5 Similarities among CCT suhunit polypeptides 

Table 1 also shows the lengths in amino -acid number 
and molecular weights of the CCT subunits. They have 
similar characteristics despite the fact that they are 
only around 3 0% identical to one another in amino-acid 
sequence (Table 2) . They vary slightly in length from 
531-556 residues and in predicted molecular weight from 
57456-60636 Da. These predicted molecular weights are 
approximately consistent with those determined by SDS- 
PAGE (Figure 1) . 
15 The pis of the subunits, experimentally determined by 

isoelectric focusing, correlate well with the total 
charge value of each predicted CCT subunit polypeptide - 
assuming some histidine residues are charged as cations 
at their respective pis (data not shown) . The percentage 
20 of hydrophobic and charged amino acids in each CCT 

subunit is highly conserved, ranging from 31.6-33.5% and 
23.9-27.3% respectively. 

These conserved chemical properties probably reflect 
common functions of each mammalian subunit and are shared 
25 with the archaebacterial chaperonin TF55 . Agard [24] 
suggested that hydrophobic interaction between the E. 
coli chaperonin GroEL and its substrates could be 
important for the folding process. The conserved 
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percentage of hydrophobic amino acids in CCT subunits may 
thus be important for interaction with substrates. 
,. Hydrophobicity profiles of CCT subunits suggest conserv- 
ation of hydrophobic and hydrophilic amino-acid 
5 distributions among CCT subunits, especially at the NHj 
terminus (data not shown) . 

Figure 4 shows an alignment of all eight CCT 
sequences and TF55. The alignment of the peptide 
sequences show six major gaps (Nos. 164, 201, 246, 315, 
10 392 and 508 in Figure 4) and thus indicates seven blocks 
of homology divided by the gaps. Amongst these seven 
blocks, the first (Nos. 1-164) and fifth (Nos. 392-508) 
are the largest and contain some highly conserved motifs : 
LGPKGMDKM (Nos. 52-60) (SEQ." ID NO. 66), TITNDGATIL (Nos. 
15 71-80) (SEQ. ID NO. 67), QDDEVGDGTTSW (Nos . 100-112 ) 
(SEQ. ID NO. 68), ERSLHDAL (Nos . 423 -430 ) (SEQ. ID NO. 
69), and W(A/P)GGGA (Nos. 442-448) (SEQ. ID NO. 70). 
The third motif includes a motif absolutely conserved in 
CCT subunits, GDGTT (SEQ. ID NO. 71), previously 
20 recognized by Lewis et al . [14] to be homologous to a 
nucleotide phosphate binding domain of cyclic-AMP- 
dependent kinase and other members of this kinase family 
[25,26]. Figure 5 shows a comparison between these 
motifs. This suggests that all the CCT subunits share 
25 the common function of ATPase activity. 

Structural genes encoding CCT subunits and homologous 
genes in yeast 

Southern analysis of mouse genomic DNA (Figure 6a) 
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shows the independent structural genes encoding seven 
mouse CCT subunits. Hybridization of Sac char owyces 
cerevisiae DNA with seven mouse Cct cDNA probes (Figure 
6b) shows one or two bands of Pstl-digested DNA for each 
5 probe. Two Tcp-l related genes have already been 

identified in yeast; the orthologue of Ccta/Tcp-l [27] 
and the orthologue of Cct±> [28] . Both these genes are 
essential [27,28] and temperature sensitive mutations in 
both genes affect microtubule mediated processes [28] . 
10 The CCT of S. cerevislae is composed of numerous subunits 
and the yeast homologues of CCTor/TCP-l and CCTS are 
components of the same complex since they co-purify 
together [28] , Preliminary sequence data derived from 
yeast DNA clones corresponding to the novel sequences 
15 detected with the mouse Cct probes suggests that they are 
indeed the orthologues of the mouse Cct genes. Southern 
analysis of Arahldopsis genomic DNA with the seven Cct 
cDNA probes also suggests the existence of plant 
homologues for each siibunit {data not shown) ; the 
20 Arahidopsis homologue of Ccta/Tcp-l has been reported 
[29] , It is lilcely that these seven species of CCT 
subunits are ubiquitous in all eulcaryotes . 
Determination of the subunit compositions and 
arrangements in CCT 
25 The sequence of the eight CCT subunits reveals that 

they are rather divergent at the C-terminii. A 
monoclonal antibody, 23C, had previously been made (21) 
which reacts with the C- terminus of CCTalpha (Harrison- 
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Lavoie et al, 1993, EMBO J. 7: 2847-2853) and can be used 
to precipitate CCT complexes. Subunit specific 
. antibodies have now been made, utilising the sequence 
information provided by the invention. The antibodies 
5 were raised against C-terminus peptides with the 
following sequences: 

CCTbeta: APRKRVPDHHPC (SEQ. ID NO. 1) 

CCTgamma: NRQTGAPDAGQE (SEQ. ID NO. 2) 

CCTdelta: SILKIDDWNTR (SEQ. ID NO. 3) 

10 CCTepsilon: IDDIRKPGESEE (SEQ. ID NO, 4) 

CCTeta: SAGRGRGQARFH (SEQ. ID NO. 5) 

CCTtheta: SGKKDWDDDQND (SEQ. ID NO. 6) 

CCTzeta: EIMRAGMSSIiKG (SEQ. ID NO. 7) 

The peptides were synthesised by conventional solid phase 
15 chemistry on a commercial peptide synthesizing machine. 

An amino terminal cysteine was added to the peptides and 
used to couple them to purified protein derivative (PPD) 
solid supports. 10 milligrams of purified peptide was 
coupled to 9 milligrams of PPD and the conjugate was 
20 purified twice by column chromatography to remove 

unconjugated impurities. Rabbits were immunised over a 
96 day period. Antibodies were screened and isolated 
using their binding for the peptides, using routine 
methods (reviewed in Harlow, E and Lane, D (1988) , 
25 Antibodies: a laboratory manual , CSH Press, NY.). 

Antibodies specific for individual subunits were 
obtained. The antibodies proved to be specific for both 
mouse and human subunits . 
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Immunoprecipitation performed under native conditions 
(BOniM Hepes pH8 , 90TnM KCl, 0.5% TX-lOO) with these 
subunit specific antibodies shows that CCT consists of a 
complex mixture of particles. Different antibodies 
5 immunoprecipitate different combinations of subunits (for 
instance, see Figure 9) . Examination of different 
immunoprecipi tat ions allows the determination of the 
composition of CCT in any cell type. This information 
enables the intelligent selection of subunit combinations 
to be used in in vitro construction of CCT, for instance 
by means of recombinant expression of subunits and their 
assembly. 

Antibody characterisation 

2D-PAGE Mouse testis CCT was purified by sucro'se 
15 gradient fractionation followed by ATP-affinity column: 

chromatography as previously described [14] . Isoelectric 
focusing (lEP) was carried out according to [45]; and > was 
followed by SDS-PAGE on 8% gels and either silver 
staining or electrotransf er of proteins to nitro- 
20 cellulose, immunoblotting and detection by the ECL system 
(Amersham) as previously described [14] . 
Non- denaturing- isoelectric focusing 

Mouse testis CCT was partially purified by sucrose 
gradient fractionation according to [14] . The fraction 
25 corresponding to 20% sucrose was mixed with an equal 

volume of sample buffer (4 0% sucrose w/v, 2% Ampholytes 
[Resolyte 4-8, BDH] ) and subjected to non-denaturing 
• isoelectric focusing at 500V for 4 hours. The non- 
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denaturing lEF was performed in the same way as 
denaturing lEF described by [45] with the substitution of 
, 40% sucrose for 50% urea and the omission of 3-[{3- 
cholamidopropyl) dimethylammonio] -1-propanesulfonate 
5 (CHAPS) in the acrylamide gel mixture. Non- denaturing 

lEF was followed by SDS-PAGE on 8% gels and either silver 
staining or electrotransf er of proteins to nitro- 
cellulose, immunoblotting and detection by the ECL system 
(Amersham) . 

10 As shown in Figure 10, mouse testis CCT contains nine 

subunit species (Sl-SO by 2D-PAGE analysis (Fig. lOA and 
Table 3). Tcp-l and seven Tcp-l related genes encoding 
the or, (3, y, 5, e, ^, 77 and 6 subunits of mouse CCT have 
been cloned [52] ; the testis-expressed subunit, S6, and a 

15 co-purifying 63lcDa protein (arrowed in Fig. lOA) which 
may be a further subunit of mouse CCT remain to be 
cloned. We have obtained DNA sequences of S6 and S7 and 
have established that they are encoded by two closely 
related CCTf genes. We expect that S6 and S7 can 

20 substitute for each other in CCT and may confer some 
tissue specificity in function of CCT. 

A previously characterised monoclonal antibody, 91a 
[41,52], was used to detect CCTof (Fig. lOD) . Six 
antibodies recognising single subunit species, BC-1, GC- 

25 1, DC-1, EC-1, TC-1 and THC-2 were produced, which 
reacted specifically with the /3, Y, 6, e, 77 and 6 
subunits of CCT respectively (Fig. lOB, IOC, lOE, lOF, 
lOH and 101) . In addition to CCTf the antibody ZC-1 also 
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reacted with the testis -expressed subunit 36 (Fig. lOG) , 
which indicates that these two subunits S6 and S7, have 
related carboxy terminal sequences and this supports the 
DNA sequence data that they are encoded by closly related 
5 genes, A polyclonal rabbit antibody, UM-1, was made to 
an amino terminal consensus motif which is highly 
conserved between all chaperonin sequences and is 
proposed to be involved in ATP binding and hydrolysis 
(Table 3) [52,53] . UM-1 reacted with all nine subunits 
10 of mouse testis CCT including the testis-expressed 
subunit 36. CCT/S, CCTe , CCTy, CCTT and 36 were 
recognised strongly (Fig. lOJ) , which CCTa, CCT5 , CCT77 
and CCTfi were recognised weakly (not shown in Fig, 10 J 
but seen on a longer exposure) . In addition to the nine 
15 subunits described above UM-1 strongly reacted with the 
co-purifying 63kDa protein, which suggests that this 
protein could be an additional subunit species of mouse 
testis CCT or alternatively a modified isoform of a 
previously identified CCT subunit. 
20 Resolution of native populations of CCT 

Partially purified CCT from mouse testis germ cells 
resolves into two distinct populations, I and II, . when 
subjected to non-denaturing isoelectric focusing 
(Fig. 11) . These populations were further analysed in a 
25 second dimension by SDS-PAGE and visualised by silver 
staining (Fig.llA) or immunoblotting with specific 
antibodies (Fig. IIB and IIC) . A striking difference 
between the two populations is that a large number of 
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polypeptides, in addition to subunits of the CCT complex, 
are associated with II whilst I has only a few co- 
migrating species (Fig. liA) . It is likely that many of 
these associated polypeptides are substrates for folding 
5 by CCT and that II has a high affinity for binding 

polypeptide chain substrates and I has a low affinity. 
To date, only tubulin and actin have been established as 
substrates for folding by CCT in vivo [15,18,16] and both 
^-tubulin (Fig. iic) and actin (data not shown) are found 
10 associated with II but not with I. 

Arrowheads show hsp 70 proteins which resolve away 
from CCT (Fig. IIA) , possibly due to the high pH 
conditions during sample loading. Small amounts of hsp 
70 proteins co- immunoprecipitate with CCT under native 
IS conditions [14,52] suggesting that hsp 70 is found 

associated due to its interaction with polypeptide chains 
undergoing folding on CCT. Therefore, this technique is 
a stringent analysis of protein interactions and a 
reflection of the high affinity of CCT form II for 
20 substrates. 

Fig. IIB shows CCT populations I and II immunoblotted 
with a polyclonal rabbit antibody to CCTe, however both 
populations contain all nine subunits of mouse testis CCT 
as demonstrated by immunoblotting with the antibodies 
25 described above (data not shown) . Since resolution of 
native lEF markers in this system demonstrates that 
separation is on the basis of charge and not size (data 
not shown) , we propose that I and II represent two 
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distinct conformations of hetero-oliogmeric CCT which are 
resolved under these conditions due to different 
complements of surface charges being exposed in each 
conformation . 

5 During the biochemical extraction of CCT from mouse 

testis and from other sources, we often observe limited 
proteolysis of all the subunits in preparations of intact 
900kDa CCT complex. A 30kDa fragment of CCTe (arrowed in 
Fig. iiB) is detected only in form II. This differential 
susceptibility to proteolysis indicates that the cleavage 
site is more accessible in II, which is supporting 
evidence that I and II adopt different conformations. 
The size of these proteolytic fragments are between 28 to 
30 kDa suggesting that the protease sensitive sites 
reside in the putative apical polypeptide chain binding 
region of each CCT subunit [53] . 
Discussion of antibodies 

Antibodies specific for each subunit species will be 
useful in investigating the combination and arrangement 
of subunits in the CCT complex and the antibody, UM-1, 
which recognises all the subunits can be used as a 
general reagent to identify CCT from other eukaryotes. 
Characterisation of mouse testis CCT revealed that the 
testis-expressed subunit, S6 [52], is related to S7 and 
that a co-purifying 63kDa protein may be a novel subunit 
of mouse testis CCT. The 63 kDa protein is probably 
encoded by a novel Tcp-I-related gene. 

A problem with the biochemical analysis of large 
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molecular weight complexes such as CCT is in resolving 
different states even when these states differ by the 
addition of accessory polypeptides, such as substrates or 
CO- factors. Herein described a non-denaturing 
5 isoelectric focusing technique that facilitates the 
resolution of two forms of CCT. One form is bound to 
many other polypeptides and is susceptible to proteolysis 
whilst the other form is bound to only a few other 
polypeptides and is resistant to proteolysis. 
10 These results suggest that CCT in the cell may adopt 

two distinct conformations with different affinity for 
polypeptide chain substrates. The conformation with high 
binding affinity for substrates may expose a domain which 
is susceptible to proteolysis in each CCT subunit, and 
15 this site may be located within the sxibstrate binding 

domain since it is in a similar position to the substrate 
binding domain of GroEL [54,55], In GroEL, dramatic 
structural changes have been associated with ATP binding 
[5,56] . One interpretation of our data is that binding 
2 0 or hydrolysis of ATP acts as a switch between 

conformational forms of CCT that interact strongly or 
weaJcly with polypeptide chain substrates. The wide range 
of polypeptides found associated with CCT suggests that 
the physiological substrates for CCT may not be limited 
25 to actin and tubulin and implies that CCT has a general 
role in protein folding in the eulcaryotic cytosol . 
Production and assembly of polypeptide components of a 
folding con^lex 
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(1) In order to create a complex able to fold a 
polypeptide, messenger RNA dependent rabbit reticulocyte 
lysates are used. Synthetic, capped mRNAs encoding each 
of the CCT siibunits to associate are independently 
5 synthesised in vitro by RNA transcription^ of each 

linearised Bluescript™ plasmid containing a full-length 
coding CCT sequence under the appropriate control 
sequences. (Sambrook et al, 1989, Molecular Cloning: a 
Laboratory Manual, CSH Press, New York) These RNAs are 
10 added to rabbit reticulocyte lysate containing ^^S- 

methionine for synthesis of ^^S-lafaelled CCT subunits, 
which facilitates detection on native electrophoretic 
gels or sucrose density gradients. The presence of 
particular subunits is confxrmed also using subunit- 
15 specific antibodies, by immunoprecipitation. 

Synthesis of subunits together leads to the assembly 
of a complex whose ability to bind to and/or fold a 
polypeptide can be tested. 

Translating mRNA encoding a substrate polypeptide to 
20 be folded, such as actin or tubulin, with the subunits 
enables detection of complexes of labelled CCT and 
substrate. Labelled substrate facilitates this. 
Although rabbit reticulocyte extracts contain endogenous 
CCT, this does not cycle with the newly synthesised CCT 
25 subunits. 

(2) The rabbit reticulocyte system of (1) is used to 
investigate different combinations of subunits to assess 
their ability to associate and form a complex able to 
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bind and/or fold a polypeptide of interest. For example, 
an assembled complex of translation products of mRNAs 
encoding eight subunits (SI, S2, S3, S4, S5, S6 or S7, S8 
and S9) is precipitated using an antibody specific for 
5 any one of the subunits (Figures 13 and 14) and its 

binding to a substrate also translated in vitro, such as 
tubulin, is testable (16) . 

(3) Subunits are expressed in E. coli using appropriate 
vectors. The combination of subunits chosen for a 
particular purpose, ie folding of a particular 
polypeptide, is made in large quantities. The subunits 
associate to form heteromeric structures of expected 
sedimentation properties and molecular weight. Subunits 
may be expressed individually and then purified, with the 
15 assembly carried out in vitro by mixing CCT subunits in 
the appropriate combinations. 

(4) For large scale production, subunits are expressed 
using Baculovirus. A nucleotide sequence encoding a 
subunit is inserted into a suitable expression vector and 
operably linked to control elements within that vector. 
The vector 'and wild- type viral genome are transfected 
into an insect host cell where the vector and viral 
genome are allowed to recombine to form a recombinant 
baculovirus expression vector. The baculovirus vector is 
packaged into infectious recombinant baculoviruses . Large 
quantities of subunits are produced which are able to 
associate to form complexes able to fold a polypeptide 
substrate . 
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(5) Folding of a polypeptide substrate, for instance 
tubulin, actin or lucif erase,' is tested by a standard 
technique as described in Frydman et al (17) . See also 
references (4) , (15) and (16) . 
5 Assenbly of mouse CCT from 8 recombinant cDNA clones. 

The purpose of these experiments is to show that the 
8 CDNA clones encoding the CCTa, /3, y, 6, e, f, and 
subunits are necessary and sufficient to produce 
recombinant CCT proteins capable of co-assembling into 
the heteromeric core-CCT complex which we expect from our 
biochemical and genetic analysis to be constructed from 
these 8 subunits. We have demonstrated this in the 
following way. Plasmids containing T3 or T7 promoters 
upstream of the CCT cDNA open reading frames were 
15 linearized with appropriate restriction endonucleases to 
produce templates for the synthesis of each CCT mRNA by 
T3 or T7 RNA polymerase. The plasmids which we use are • 
CCTof/pTieil, CCT^/pT/32, CCTr/pT-yV, CCT6/pT62, CCT6/pT£5, 
CCTf/pTfl2, CCT77/PT7729 and CCTfi/pT9l. 

When rabbit reticulocyte lysate was programmed singly 
with each CCT subunit encoding cDNA a 35S-labelled 
polypeptide of expected molecular weight was synthesized 
(Figure 12) . 

To show assembly, the lysate was programmed with all 
25 eight subunit encoding plasmids together. After 

synthesis the reaction mix was applied to a 10-40% 
sucrose gradient in order to resolve the assembled CCT 
complexes from the free subunits as shown previously by 
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US (Reference 14) , Labelled CCT complex can be detected 
at 20% sucrose density (Figure 13A) and all the subunits 
are immuno-precipitated using antibodies specific to 
single subunits (Figure 13B) . This demonstrates the 
5 principle of co-assembly from the 8 subunit genes. This 
process is repeatable using subunit proteins expressed in 
mammalian tissue culture cells, bacteria and yeasts. 
Yeast expression 

The Plschla, pastoris heterologous gene expression 
10 system has been used to express individual CCT subunits 

in yeast. Specifically we inserted the CCTe open reading 
frame into pHIL-D2, This construct was used to transforro 
Plschia producing strains expressing CCTe under the 
control of the AOXl promoter. These strains, such as 
15 WIL5, were shown to produce CCTe in response to induction 
by methanol, both using small scale cultures (to measure 
time course of induction) and subsequently by large-scale 
fermentation. In order to follow expression of mouse 
CCTe in Plschia pastorls in these experiments anti-CCTe 
20 antibodies disclosed herein were utilised. Without the 
antibodies the experiments would not have been possible. 
All procedures were according to the In Vitrogen manual 
and as described in Cregg, J.M. et al . , 1993 
Bio/Technology, Vol. 11, 905-910, 
25 Expression in human cells 

CCT subunits were expressed in human cells as free 
subunits and incorporated into endogenous CCT complex 
using standard eukaryotic expression vectors such as 
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pcDNAl. Specifically, the CCTo; and CCT/3 open reading 
frames were subcloned into pcDNAl giving plasmids pEXor 
and pEX/3. These plasmids were transfected into human 
293T cells and CCT protein expression was measured by 
5 Western blotting and immunoprecipitation using antibodies 
disclosed herein. Substantial expression of CCTor and 
CCT^ was obtained. 
Additional Discussion 

The symmetry of each ring of the CCT double -torus 

10 particle is yet to be determined, although EM-negative 
stain analysis suggests that it is likely to comprise 8 
or 9 subunits [14] , Although the stoichiometric 
composition of CCT with respect to individual polypeptide 
species has not yet been determined biochemically, we 

15 have confirmed that there are seven species of CCT 
subunits by 2D-NEPHGE analysis of antibody-affinity 
purified CCT purified from mouse F9 cells. We have been 
able to assign eight CCT subunit proteins to eight Cct 
genes including Tcp-l/Cctot. The analysis of CCT [14] and 

20 the actin chaperonin [15] by negative stain electron 

microscopy is consistent with the heteromeric nature of 
the CCT since the particles appear quasi -symmetrical with 
different sized/shaped subunits. Lewis et al . [14] 
previously showed that mouse testis CCT. contained more 

2 5 subunits than human HEp2 cell CCT and Roobol and Carden 
[19] found differences between brain CCT and testis CCT 
derived from rats and guinea-pigs. These data are 
consistent with the idea that a particular CCT particle 
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is composed of several kinds of subunits, although some 
subunits may be replaceable depending upon tissue type. 

The eight genes (the seven new ones plus the original 
TCP-1 (alpha) ) described herein are sufficient to 
5 reconstitute CCT. Certain cell types such as mouse F9 
and human HEp-2 contain abundant levels of 7 CCT 
subunits, with the S2 subunit, encoded by the CCTe gene, 
appearing to be absent, based on protein analysis but it 
may be rapidly turning over or post- translationally 
10 modified causing levels of S2 to be difficult to measure. 
The role of the mouse testis expressed subunit, S6, is 
unclear but it probably has an exchangeable function with 
SI , the product of the CCTzeta. gene, given that 
antibodies raised against S7 react with S6 and that we 
15 have DNA clones for two closely related CCTf genes. 
Northern blotting and protein analysis lead to the 
conclusion that SI, S2 , S3, S4, S5, S7, S8 and S9 are 
ubiquitously expressed and therefore that they constitute 
the eight essential components of general CCT activity. 
2 0 One can build/reconstruct CCT folding machines 

(complexes) using combinations of these seven components. 
Specialised proteins may need the presence of an 
additional, eg tissue specific, subunit or other 
cof actors to be correctly folded. However, these should 
25 be able to be incorporated into CCT machines in 
combination with the subunits provided herein. 
Evolutionary origin of CCT suhunit genes 

All organisms are classifiable into three primary 
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kingdoms, eubacteria, archaebacteria and eukaryotes, 
based upon ribosomal RNA sequences [3 0] . Several 
proteins found in archaebacteria and eukaryotic cytosol 
are similar to each other and those of mitochondria, 
5 chloroplasts and eubacteria are also similar to each 
other [31] . 

Our sequence analysis has shown that each CCT subunit 
is very similar to TF55 of an archaebacterium Sulfolobus 
shibatae (Figure 4, Table 2) and only weakly related to 
10 GroEL and other chaperonin proteins which are thought to 
have evolved from a common eubacterial origin [10] . 
Another archaebacterium, Pyrodictium occultum, also has 
an ATPase complex whose tryptic peptides share 70% 
identity with TF55 [32] . THese results suggest that all 

15 the CCT genes evolved from a common ancestor of 

eukaryotes and archaebacteria. Each eukaryotic Cct gene 
is as divergent from another Cct gene as it is from TF55 
and this suggests that the eukaryotic genes diverged from 
one another very early in the eukaryotic lineage. An 

20 evolutionary tree of CCT subunit s and TF55 based on 

amino-acid substitutions supports these ideas (Figure 7) , 
If the amino-acid substitution rate of each CCT subunit 
is constant during evolution [3 3] and assuming that 
yeasts and animals diverged 1000-1200 million years ago, 

25 then we calculate 1800-2400 million years for the 
divergence times of CCT subunits. 

However, the orthologues of each CCT member in 
animals, plants and yeasts are much more similar to one 



wo 95/20654 PCT/GB95/00192 

50 

another and this suggests that each subunit has evolved 
an independent function a long time ago which has been 
maintained in most eukaryotes . Since CCT is composed of 
7-9 subunit species and other chaperonins have only one 
5 or two subunit species, it seems likely that the CCT has 
evolved more complex functions in eukaryotic cytosol . 
The increased complexity of CCT may have facilitated the 
evolution of the highly organized eukaryotic cytosol by 
co-evolution between CCT and substrates. 
10 Function of the CCT 

CCT, TF55 and classical chaperonins (GroEL, Hsp60 and 
Rubisco binding protein) all have ATPase activity. It 
seems likely that the conservation of sequence in this 
family reflects the maintenance of ATPase activity rather 
15 than the ability to form double torus structures which 
many unrelated proteins, such as glutamine synthetase 
[34] can adopt. In GroEL the conformation of the 
particle changes by rotation of the GroEL subunits 
relative to the perpendicular axis upon binding ATP [5] . 
2 0 EM analysis of the act in chaperonin in the presence and 

absence of ATP suggests that similar large conformational 
changes may occur [15] . There are three highly conserved 
motifs among the TCP-1, TF55 and GroEL family chaperonins 
(see Figure 4 in [14]). Two motifs, 
25 (I/V)T(N/K)DG(A/V) (T/S) (SEQ. ID NO. 72) and GDGTT(S/T) 
(SEQ. ID NO. 73), are positioned at the NH2 terminus and 
one, V(A/P)GGG (SEQ. ID NO. 74), towards the COOH 
terminus. These three motifs are also conserved in the 
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other CCT subunits and interestingly the amino-acid 
sequences around GDGTT{T/S) (Figure 5) have homologies to 
the phosphate binding domains for ATP hydrolysis shared 
by the cAMP- dependent kinase family proteins [25,26] . If 
5 these widely separated motifs are indeed involved in ATP 
binding and hydrolysis, then they may be responsible for 
the large conformational changes the chaperonins make 
upon ATP binding. This idea is supported by our 
knowledge of the structure of cAMP- dependent protein 
10 kinase-a [26], Hsp70 [35,36] and actin/myosin [37]; 

several domains required for ATPase activity are widely 
separated in the amino-acid chains of these proteins. 

One possible reason for the multiplicity of CCT 
subunit species may be the contributions of each subunit 
15 to interaction with other co-factors involved in protein 
folding processes. We previously showed that CCT binds 
Hsp70 species [14] . In E. coli, DnaK (Hsp70 family) and 
GroE (Hsp60 family) are thought to cooperate with DnaJ in 
the folding of newly synthesized proteins [38,3 9] . It is 
20 possible that a similar mechanism involving Hsp70, DnaJ 

homologues [40] and CCT occurs in eukaryotic cytosol . We 
found a weak homology between the COOH terminus of CCT6 
and Hsp70 family proteins (not shown) and this may 
suggest a role for CCT6 in interactions between 
25 chaperones and co-factors. 

It is known that some chaperonins (GroE, Hsp60) are 
not able to fold substrates which can be folded by CCT; 
mitochondrial Hsp60 does not fold or form a binary 
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complex with denatured of-actin [15] and GroEL/ES does not 
fold luciferase although it can bind it [17] . This 
suggests that the substrate specificity of classical 
chaperonins and CCT differs and that CCT might have 
5 changed its spectrum of substrate building by evolving a 
far larger number of subunit species than the single 
subunit species chaperonins. Sternlicht et al , [18] 
reported that a 40 IcDa newly synthesized protein co- 
purified with CCT in addition to act in and tubulin. This 
10 would seem to be a novel substrate of CCT. The general 
function of CCT, in addition to its folding of actin and 
tubulin, is illustrated by a genetic experiment in S. 
cerevisiae. Expression of human HTR3 cDNA which encodes 
a partial TCP-1 related polypeptide rescued a mutation in 
15 amino-acid transport [41] , The HTR3 polypeptide has 96% 
homology to mouse CCTf, and is thus the human orthologue 
of CCTf. This suggests that CCT might assist folding of 
a protein affecting amino-acid transport. CCT binds in 
vitro to chromaffin granule membranes coupled to an 
20 affinity matrix [42] . This suggests a role for CCT for. 
the regulation of secretory vesicles and membrane 
trafficking events, A short peptide seqiience of the TCP- 
1- related chaperonin of oats [42] which seems to have 
evolved the specialized function of folding phytochrome, 
25 is more related to CCT6 (Residue nos . 369-384 in Figure 
4) than the other CCT subunits. Specialized functions 
might have been derived from different primordial Cct 
genes in different ancestral organisms. 
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CCT is believed to act mainly as folding machinery 
for tubulin and actin by many investigators despite the 
fact that these proteins show no significant amino-acid 
sequence homology. Actin and tubulin are extremely 
5 abundant proteins in the cell, and thus they were 
probably the easiest proteins to be detected as 
substrates [18] . 

Table 4 summarises proteins known to be folded by 
CCT. Firefly lucif erase has been reported to be folded 
10 by CCT in vitro (17, 61) and brain CCT was purified as- a 
neurofilament fragment binding protein [19] . Lingappa et 
al. [62] reported that TCP-l and/or a TCP-l-like 
chaperonin is involved in capsid assembly by hepatitis 
virus in vitro. Thus, CCT may assist in the folding of 
15 not only actin and tubulin but also a more wider range of 
proteins in vivo. 

Functional analysis of each subunit species will 
provide further information of the role of CCT in vivo in 
general protein folding events in eukaryotic cells. 
20 We have identified nine subunit species of CCT in 

mammalian testis and seven major species in mouse F9 
cells and we cloned seven Tcp-I related genes encoding 
CCT subunits found in common to testis and F9 cell CCT. 
Tcp'l/Cctot and these genes are so highly conserved from 
25 mammals to yeast they can be detected by cross-species 

Southern blotting analysis. The amino-acid sequences of 
the mouse and yeast orthologues of two CCT subunit 
species (CCTa and CCTS) show 61% and 66% sequence 
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identity, respectively. All CCT subunits contain a motif 
similar to an ATP binding domain of cAMP- dependent 
protein kinase and other kinases of this family, 
suggesting all CCT subunits have independent ATPase 

5 activities. The mouse CCT subunits show only 26%-35% 

homology between one subunit protein species and another 
and phylogenetic analysis suggests that the divergence 
time of CCT subunit genes is approximately two times as 
long as the divergence time of animals and yeasts, 

0 These observations suggest that each subunit has 

evolved a specific, independent function in addition to 
the common ATPase function a long time ago and that these 
functions are maintained in all eukaryotes . The 
expansion of the number of subunit species compared to 

5 other chaperonins may have allowed more complex functions 
required for folding and assembly of highly evolved 
eukaryotic proteins. 
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Table 1. Characteristics of CCT subunits. 



Subunit 


Spot on 2D 


Wiimh^T" r^'f 


noxecuiar 


species 


gels 


amino acids 


weight 


CCTa 


S3 


556 


60513 


CCT/3 


S4 


536 


57456 


CCTy 


S5 


545 


60636 


CCT5 


S9 


539 


58073 


CCTe 


S2 


541 


59631 


CCTf 


S7 


531 


58011 


CCTry 


S8 


541 


59658 


CCT 5 


SI 


548 


59562 



The spot numbers of CCT subunits on 2D gels (S1-S9) are 
described in Figure 1. The correspondences of CCT 
subunit ot to spot S3 [14] , e to S2 , T to S7 and 6 to SI 
(Figure 10) were determined by Western blotting of 2D 
gels with antibodies which react specifically with each 
subunit. The other correspondences are derived from the 
peptide sequencing data described in Figure 3 and the 
Western blotting data in Figure 10. Numbers of amino 
acids and molecular weights of CCT subunits are 
calculated from the amino-acid sequences in Figure 3 and 
Figure 8 . 
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Table 2. Homology among mouse CCT subunits and 
archaebacterial chaperone TF55. 





CCTa 


CCTIS 


CCTy 


CCT6 


CCT6 


CCTf 


CCTt? 


1 cere 


TF55 


CCTa 
CCT^ 
CCTy 
CCT6 
CCTe 
CCTf 
CCTt? 

ccxe 

TF55 


35.4 
32.5 
32.1 
31.9 
27.1 
34.6 
27.5 
38 , 7 


7(24) 

28 .4 
28.2 
31.9 
25.0 
31.9 
25.4 
32.0 


3 (13) 
4(8) 

33.1 
32.2 
29.2 
34.6 
27.4 
34.1 


4 (12) 
4 (10) 
5(10) 

36.2 
28,5 
28.5 
27.8 
37.1 


3 (12) 
5(9) 
2(2) 
4(9) 

31.0 
28.4 
29.1 
38.0 


S (24) 
5(14) 
5(16) 
5(10) 
5(17) 

26.3 
22.5 
35.1 


5(21) 

4(8) 

4(8) 

3(4) 

4(6) 

4(8) 

24. 0 
34.9 


5(19) 
6(8) 
5(11) 
5 (11) 
4(6) 

3 (12) 

4 (8) 

5 (10) 


4 (16) 

6(8) 

2(2) 

5(10) 

3(6) 

4(13) 

3(6) 

29.0 



10 



15 



Percentages of identical amino acids are shown below the 
diagonal and numbers of gaps /insert ions are above the 
diagonal. Numbers in parentheses are total lengths of 
gaps /insert ions given in amino acid numbers. 



20 



25 



30 



35 



Table 3. Immunochemical analysis of CCT subuniti 



Antibody 
used for 
detection 


Sec[uence of 

peptide 

inununogen 


Corresponding 
subunit of 
mouse CCT 


Species 
recognised on 
2D gel 


THC-2 
EC-1 
91a* 
BC-1 
GC-1 
2C-1 
TC-1 
DC-1 
UM-1 


SGKKDWDDDQND 
IDDIRKPGESEE 

APRKRVPDHHPC 
NRQTGAPDAGQE 
EIMRAGMSSLKG 
SAGRGRGQARFH 
SILKIDDWNTR 
QDDEVGDGTTSW 


CCTd 

CCTe 

CCTa 

CCT/? 

CCTy 

CCTf 

CCTt; 

CCT5 
chaperonin 
consensus 

motif 


SI 
S2 
S3 
S4 
S5 
S6,S7 
S8 
S9 

S1,S2,S3,S4,S5, 
S6,S7,S8,S9,p63** 



oJ^Jc?r°ur^ monoclonal antibody which recognises the C- terminus 

p63** denotes a co-purifying 63kDa protein of pi 6.93. 
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Table 4 Proteins known to be folded or assembled by CCT 
and co-factors. 





Protein 


Experiment 


Reference 


5 


actin 


in vitro, in 


vivo 


15,18 




actin-RVP 


in vitro 




57 




tubulin 


in vitro, in 


vivo 


16,17,18,57,58,59,60 




neurofilament (fragment) 


in vitro 




19 




lucif erase 


in vitro 




17,61 


10 


hepatitis B virus capsid 


in vitro* 




62 



* TCP-1 or TCP-l-like chaperonin recognized by einti-TCP-l antibody. 
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CLAIMS 



1. A DNA isolate comprising a nucleotide sequence 
encoding a subunit of a polypeptide folding complex, the 
5 subunit having an amino acid sequence shown in any of 
SEQ. ID NO.'s 25, 27, 28, 29, 30 and 31 or an amino acid 
sequence having a significant degree of homology to a 
shown sequence . 

0 2. A DNA isolate according to claim 1 wherein the 
nucleotide sequence is any of SEQ. ID NO.'s 55, 57, 58, 
59, 60 and 61. 



3. A DNA isolate according to claim 1 wherein the 
15 nucleotide sequence is an allele, a mutant or 

derivative, by way of nucleotide deletion, substitution 
or insertion, of any of the nucleotide sequences shown 
in SEQ. ID NO.'s 55, 57, 58, 59, 60 and 61. 

20 4. A DNA isolate comprising a nucleotide sequence 

encoding a subunit of a polypeptide folding complex, the 
subunit comprising an amino acid sequence which is a 
mutant or derivative, by way of amino acid deletion, 
substitution or insertion-, of any of the amino acid 

25 sequences shown in SEQ. ID NO.'s 25, 27, 28, 29, 30 and 
31. 



A vector comprising DNA according to any one of 
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claims 1 to 4 and nucleic acid for expression of the 
encoded subunit . 

6. A host cell comprising a vector according to claim 
5 5. 

7. A host cell according to claim 6 which is 
prokaryotic. 

10 8. A host cell according to claim 6 which is 
eukaryotic. 

9. A method of making a subunit of a polypeptide 
folding complex comprising^ expression from a vector 

15 according to claim 5 . 

10. A method according to claim 9 wherein the 
expression results from culturing a host cell comprising 
the vector - 

20 

11. A method according to claim 10 wherein the host 
cell is prokaryotic. 

12. A method according to claim 10 wherein the host 
25 cell is eukaryotic - 



13. A method of producing a polypeptide folding 
complex, the method comprising expression of subunits of 
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the complex from encoding nucleic acid therefor and 
causing or allowing assembly of the subunits into the 
complex. 



5 14. A method according to claim 13 wherein the subunits 
comprise any of the amino acid sequences shown in SEQ. 
ID NO.'s 25, 26, 21, 28, 29, 30 and 31 or an amino acid 
sequence having^ a significant degree of homology to a 
shown sequence . 

10 

15. A method according to claim 14 wherein the nucleic 
acid encoding any of the subunits has a nucleotide 
sequence which is an allele, a mutant or derivative, by 
way of nucleotide deletion,* substitution or insertion, 

15 of any of the nucleotide sequences shown in SEQ, ID 
NO.'s 55, 56, 57, 58, 59, 60 and 61, 

16. A method according to claim 13 wherein the amino 
acid sequence of any of the subunits is a mutant or 

20 derivative, by way of amino acid deletion, substitution 
or insertion, of any of the amino acid sequences shown 
in SEQ. ID NO.'s 25, 26, 27, 28, 29, 30 and 31. 

17. A method according to any one of claims 13 to 16 
25 wherein assembly of the subunits into the complex is by 

in vitro mixing following expression. 



18. A method according to any one of claims 13 to 16 
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wherein the subunits are expressed together. 

19. A purified polypeptide subunit of a polypeptide 
folding complex comprising an amino acid sequence shown 
5 in any of SEQ. ID NO.'s 25, 27, 28, 29, 30 and 31, or a: 
amino acid sequence having a significant degree of 
homolocfy to a shown sequence . 



20. A polypeptide subunit of a polypeptide folding 
10 complex comprising an amino acid sequence which is a 

mutant or derivative, by way of amino acid deletion, 
substitution or insertion, of any of the amino acid 
sequences shown in SEQ. ID NO.'s 25, 27, 28, 29, 30 and 
31. 

15 

21. A polypeptide folding complex comprising 
polypeptide subunits selected from subunits according to 
claim 19 or claim 20, optionally a subunit comprising 
the amino acid sequence which is shown in SEQ. ID NO. 26 

20 or an amino acid sequence which is a mutant or 

derivative, by way of amino acid deletion, substitution 
or insertion, of the amino acid sequence shown in SEQ. 
ID NO. 26, or an amino acid sequence having a 
significant degree of homology to the sequence shown in 

25 SEQ. ID NO. 26, and optionally a subunit comprising the 
amino acid sequence which is shown in SEQ. ID NO. 24 or 
an amino acid sequence which is a mutant or derivative, 
by way of amino acid deletion, substitution or 
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insertion, of the amino acid sequence shown in SEQ. ID 
NO. 24, or an amino acid sequence having a significant 
degree of homology to the sequence shown in SEQ. ID NO. 
24 . 

5 

22, Use of a polypeptide folding complex according to 
claim 21 in the folding of a polypeptide. 

23 . A method of folding a polypeptide employing a 
10 polypeptide folding complex according to claim 21. 

24. A method according to claim 23 wherein the folding 
of the polypeptide follows assembly of the polypeptide 
folding complex from subunits thereof, following 

15 production of the subunits by recombinant expression. 

25. An antibody or fragment thereof able to bind 
specifically to a subunit of a polypeptide folding 
complex other than TCP-l (CCTa - SEQ. ID NO. 24) . 

20 

26. An antibody or fragment thereof according to claim 
25 wherein the subunit to which the antibody or fragment 
thereof is able to bind is any of CCTj3 (SEQ. ID NO. 25), 
CCTy (SEQ. ID NO. 26) , CCT6 (SEQ. ID NO. 27), CCTe (SEQ. 

25 ID NO. 28), CCTr (SEQ. ID NO. 29), CCTt? (SEQ. ID NO. 30) 
and cere (SEQ. ID NO. 31) . 



27. An antibody or fragment thereof according to claim 
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26 having binding specificity for a peptide which has an 
amino acid sequence shown in any of SEQ. ID NO.' s 1, 2, 
3, 4, 5, 6 and 7, or which has an amino acid sequence 
which is a variant of any of those shown in SEQ. ID 
5 NO. 'si, 2, 3, 4, 5, 6 and 7 , 



28. A method of obtaining an antibody or fragment 
thereof according to claim 25 comprising administering a 
peptide which has an amino acid sequence shown in any of 
10 SEQ, IDNO^'s 1, 2, 3, 4, 5, 6 and 7, or which has an 
amino acid sequence which is a variant of any of those 
shown in SEQ. IDNO.'s 1, 2, 3, 4, 5, 6 and 7, to a 
mammal and recovering antibody from the mammal . 
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Fig. 1b. 
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Fig. 3a. 

MASLSLAPVNIFKAGADEERAEIARLSSFIGAIAIGDLVKSTLGPKGMDK 0 50 

I L L S S GRD AALM VTNDGAT I LKNIGVDNPAAKVLVDMSR VQDDEVGDGTT 10 0 

P1(T31) 

SVTVLAAE LLREAESLIAKKIHPQTIISGWREATKAAREALLSSAVDHGS 15 0 

DEARFWQDLMNIAGTTIiSSKLLTHHKDHFTKLAVEAGLRLKGSGNLEAIH 200 

VIKKLGGSLADSYLDEGFLLDKKIGVNQPKRIENA KILIAN TGMDTDKIK 2 5 0 

B4 (R162) 

IFGSRVRVDSTAKVAEIEHAEKEKMKEKVERIIiKHGINCFINRQIiIYNYP 30 0 

EQLFGAAGVMAIEHADFAGVERLALVTGGEIASTFDHPELVKLGSCKLIE 3 5 0 

EVMIGEDKLIHFSGVALGEACTIVLRGATQQILDEAERSLHDALCVLAQT 40 0 

VKDPRTVYGGGCSEMLMAHAVTQIiANRTPGKEAVAMESFAKALRMIiPTII 4 5 0 

ADNAGYDSADLVAQLRAAHSEGHITAGLDMKEGTIGDMAVLGITESFQVK 50 0 
RQVLLSAAEAAEVILRVDNI IKAAPRKRVPDHHPC 5 3 5 



Fig. 3b. 

MMGHRPVIiVLSQNTKRESGRKVQSGNINAAKTIADIIRTCLGPKSMMKML 05 0 

LDPMGGIVMTNDGNAILREIQVQHPAAKSMIEISRTQDEEVGDGTTSVII 10 0 

LAGEMLSVAEHFLEQQMHPTWISAYRMALDDMISTLKKISTPVDVNNRE 15 0 

MMLSIINSSITTKVISRWSSI-ACNIALDAVKTVQFEENGRKEIDIKKYAR 20 0 

VEKIPGGIIEDSCVLRGVMINKDVTHPRMRRYIKNPRIVLLDSSLEYKKG 2 5 0 

ESQTDIEITREEDFTRI LQMEE EYIHQLCEDIIQLKPDWITEKGISDIiA 300 

^ XX"^ P5{T36) 

QHYLMRANVTAIRRVRKTDNNRIARACGARIVSRPEEI-REDDVGTGAGLL 3 5 0 

EIKKIGDEYFTFITDCKDPKACTILLRGASKEILSEVERNLQDAMQVCRN 40 0 

VHiDPQLVPGGGASEMAVAHALTEK S KAMTGVE QWPYRAVAQALEVIPRT 4 5 0 

P5{T32) B1(R133) 

LIQNCGASTIRLLTSLRAKHTQESCETWGVNGETGTIiVDMKELGIWEPIiA 5 0 0 

VKLQTYKTAVETAVLIiLRIDDIVSGHKKKGDDQNRQTGAPDAGQE 54 5 



SUBSTITUTE SHEET (RULE 26) 
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Fig. 3c. 



MPENVASRSGAPTAGPGSRGKSAYQDRDKPAQIRFSNXSAAKAVADAIRT 

SLGPKGMDKMIQDGKGDVTITNDGATILKQMQVLHPAARMLVELSKAQDI 

EAGDGTT S VVII AGSLLDSCTKLLQKGIHPTIISESFQKALEKGLEILTD 
X P3(T31) 

MSRPVQLSDRETLLNSATTSLNSKWSQYSSLLSPMSVNAVMKVIDPATA 

TSVDLRDIKIVKKLGGTIDDCELVEGLVLTQKVANSGITRVEKAKIGLIQ 

FCIiSAPKTDMDNQIVVSDYAQMDRVIiREERAYIIiNLVKQIKKTGCNVLLI 

QKSILRDALSDLAIiHFLNKMKIMVVKDVEREDIEFICKTIGTKPVAHIDQ 

FTADMLGSAELAEEVSLNGSGKLFKITGCTSPGKTVTIWRGSNKLVIEE 

AERSXHDALCVIRCIiVKKRALXAGGGAPEXEIiAIiRIiTEYSRTLSGMESYC 

VR AFADAMEVXPSTLAENAGLNPXSTVT ELRNRHAQGEKTTGXNVRKGGX 

P3(T50) X 
SNXLEEMWQPLIiVSVSAIiTLATETVRSXLKIDDWNTR 53 9 



MASVGTIiAFDEYGRPFLIXKDQDRKSRLMGLEALKSHXMAAKAVANTMRT 
SliGPNGLDKMMVDKDGDVTXTNDGATILSMMDVDHQXAKLMVELSKSQDD 
EXGDGTTGWVLAGALIiEEAEQLLDRGXHPXRXADGYEQAARIAXQHIiDK 
ISDKVLVDXNNPEPLXQTAKTTIiGSKVXNSCHRQMAEXAVNAVLTVADME 
RRDVDFELXKVEGKVGGRLEDTKLXKGVXVDKDFSHPQMPKKVVDAKXAX 
LTCPFEPPKPKTKHKLDVMSVEDYKALQKYEKEKFEEMXKQXKETGANLA 
XCQWGFDDEANHLLLQNGLPAVRWVGGPEXELXAXATGGRXVPRFSELTS 
EKLGFAGWQEXSFGTTKDKMLVXEKCKNSRAVTXFXRGGNKMXIEEAKR 
SLHDALCVXRNLXRDNRWYGGGAAEXSCALAVSQEADKCPTLEQYAMRA 
FADAIiEVXPMALSENSGMNPXQTMTEVRARQVKESNPALGXDCLHKGSND 
MQYQHVXETLXGKKQQXSLATQMVRMXLKXDDXRKPGESEE 541 
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Fig.Se. 

MAAVKTLNPKAEVARAQAALAVNI S AARGLQDVLRTNLGPKGTMKMLVSG 0 5 0 

AGDIKLTKDGNVLLHEMQIQHPTASLIAKVATAQDDITGDGTTSNVLIIG 10 0 

ELLKQADLYISEGLHPRIITEGFEAAKEKALQFLEQVKVSKEMDRETI-ID 15 0 

VARTSLRTKVHAELADVLTEAVVDSILAIRKKDEPIDLFMVEIMEMKHKS 2 00 

ETDTSLIRGLVIiDHGARHPDMKKRVENAYILTCNVSLEYEKTEVNSGFFY 2 50 

KSAEEREKIiVKAE RKFIEDR VKKI lELKKKVCGDSDKGFWINQKGlDPP 3 0 0 

B3(R170) 

SI-DAL AKEG I VALRRAKRRNMERLTLACGG I ALNS FDDLNPDC LGHAGLV 3 5 0 

YEYTIiGEEKFTFIEKCNNPRSVTLLVKGPNKHTLTQIKDAIRDGIiRAVKN 4 0 0 

AIDDGCVVPGAGAVEVAIiAEALIKYKPSVKGRAQLGVQAFADALLI IPKV 4 5 0 

LAQNSGFDLQETLVKVQAEHSESGQLVGVDLSTGEPMVAAEMGVWDNYCV 500 
KKOLIiHSCTVIATNILLVDEIMRAGMSSLKG 531 



Fig.Sf. 

MMPTPVILLKEGTDSSQGIPQLVSNISACQVIAEAVRTTLGPRGMDKLIV 050 

DGRGKAT I SNDGATILKLLDWHPAAKTLVDIAK SQDAEVGDGTTSVTLL 10 0 

P2 

AAEFIi KQVKPYVEE GLHPQIIIRAF RTATQLAV NKIKEIAVTVKKQDKVE 15 0 

^ 52 (R115 ) B2 (R124 ) 

QRKMLEKCAMTALSSKLISQQKVFFAKMWDAVMMLDELIiQLKMIGIKKV 2 00 

QGGALEESQLVAGVAFKKTFSYAGFEMQPKKYKNPKIAIiLNVELELKAEK 2 50 

DNAEIRVHTVEDYQAIVDAEWNILYDKLEKIHQSGAKVILSKLPIGDVAT 3 0 0 

QYFADRDMFCAGRVPEEDLKRTMMACGGSIQTSVNALVPDVLGHCQVFEE 3 5 0 

TQIGGERYNFFTGCPKAKTCTIILRGGAEQFMEETERSLHDAIMIVRRAI 4 00 

KNDSWAGGGAIEMELSKYIiRDYSRTIPGKQQLLIGAYAKALEIIPRQLC 45 0 

DNAGFDATNILNKLRARHAQGGMWYGVDINNENIADNFQAFVWEPAMVRI 5 0 0 

NALTAASEAACLIVSVDETIKNPRSTVDPPAPSAGRGRGQARFH 54 4 
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Fin. 5. ^^T" KEVGDGTTSWI lA 

^* * CCTP DEVGDGTTSVTVLA 

CCTy EEVGDGTTSVIILA 

CCT6 lEAGDGTTSWI lA 

CCTe deigdgttgwvia 

CCTC DITGDGTTSNVLII 

CCTt] AEVGDGTTSVTLIA 

CCTe QEVGDGTNFVLVFA 



cAPK-a KTLGTGSFGRVMLV 

PKC-a MVLGKGSFGKVMLA 

CaMII-a EELGKGAFSWRRC 

SNFl KTLGEGSFGKVKLA 

Cdc2+ EKIGEGTYGWYKA 

CDC 7 DKIGEGTFSSVYKA 

Raf TRIGSGSFGTVYKG 

Src VKLGQGCFGEVWMG 

Abl HKLGGGQYGEVYEG 

EGFR KVLGSGAFGTVYKG 

INSR RELGQGSFGMVYEG 

PDGFR RTLGSGAFGQWEA 
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Fig. 6a. 
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Fig. 8a. 

(Ccta) 

Tcp'l cDNA sequence 



TCTCAGAGCGGAGCTATCGGTTGTCTGCGGCCGTAGTGAGCTTTGCTTCGTTTCCTGAAGATGGAGGGTCC^^ 

MEGPLSVPGD 

CGCAGCACTGGGGAGGCGGTCCGCTCCCJlGAArGTTATGGCTGCAGCTrCTATTGCCAACATTGTT;^^ 

RSTGEAVRSQNVMAAASIANIVKSSFGPVG 
TTGGATAAAATGTTGGTGGATGATATTGGTGATGTAACCATTACTAACGATGGTGCCACCATCCTGA^ 

LDKMLVDDIGDVT ITNDGATIIiKLLEVEHP 
GCAGCCAAAGTTCTGTGTGAGCTGGCTGACCTGCAAGACAAAGAAGTTGGAGATGGAACTACCT 

AAKVLCELADLQDKEVGDGTT SVVI lA AEL 
CTGAAAAATGCAGATGAGCTAGTCAAACAGAAAATTCATCCAACATCAGTTATTAGTGGCTATCGTC^ 

LKNADELVKQKIHPTSVISGYRLACKEAVR 
TATATCAATGAGAACCTGATTATTA^CACAGACGA^CTTGGAAGAGACTGTCTGATCAATACTGCTAA^ 

Y INENLIINTDELG RDCLINTAKTSMSSKI 
ATTGGAATAAATGGTGATTACTTTGCTAATATGGTAGTAGATGCTGTGCTTGCTGTTAAATA 

IGINGDYFANMVVDAVLAVKYTDARGQPRY 
CCAGTCAATTCTGTTAATATTCTGAAAGCCCATGGGAGAAGTCAGATAGAAAGCATGCl^ 

PVNSVNILKAHGRSQIESMLXNGYALNCVV 
GGATCTCAGGGCATGCCaVAGAGAATAGTTAATGCAAAAATTGCTTGTCTTGACTTCATC 

GSQGMPKRIVNAKIACLDFSLQKTKMKLGV 
CAGGTGGTTATTACyVGACCCTGAGAAATTGGACCAAATTAGAa^GAGAGAATaXSATATCACC^ 

QVVITDPEKLDQIRQRESDITKERIQKILA 
ACTGGTGCCAATGTTATTCTAACCACTGGTGGCATTGATGATATGTATCTCAAGTATTTTGTGGAAG^ 

TGANVILTTGGIDDMYLKYFVEAGAMAVRR 
GTTTTAAAACGAGACCTGAAGCATGTTGCAAAAGCTTCTGGAGCAAGTATCCTGTCT^ 

VLKRDLKHVAKASGAS ILSTLANLE-GEETF 
GAAGTGACGATGTTGGGACAAGCGGA^GAGGTCGTACAGGAGAGAATTTGTGATGATGAGCTGATCTTAAr^^ 

EVTMLGQAEEVVQERICDDELILIKNTKAR 
ACATCTGCTTCAATCATCTTACGAGGAGCAAATGATTTaVTGTGTGATGAAATGGA<^ 

T SAS IILRGAKDFMCDEMERS LHDALCVVK 
AGAGTTTTGGAGTTGAAATCTGTGGTCCCAGGTGGAGGTGCTGTAGAAGCTGCCCT 

RVLELKSVVPGGGAVEAALSIYLEKYATNM 
GGATCTCGGGAACAGCTTGCTATTGCAGAGTTTGCAAGATCrCTGCTTGT 

GSREQLAIAEFARSLLVIPNTLAVNAAQDS 
ACXrGACCTGGTTGCCAAGTTAAGAGCTTTTCAO^TGAGGCTCAA^ 

TDLVA KLRAFHNEAQVNPERKNLKWIGLDL 
GTCCATGGGAAACCACGAGACAACAAGOUIGCAGGGGTGTTTGAA 

VHGKPRDNKQAGVFEPTIVKVKSLKFATEA 
GOUVTCACCATCCTTCGGATTGACGATCTGATAAAATTACACCCAGAAAGO^ 

A ITILRIDDLIKLHPESKDDKHGSYENAVH 

TCTGGAGCCCTTGATGACTGATTCGATTTCCCTCTTATTTATAACAGTGTCAGGTG^ 
S G A L O D « 

ACAGCAAGCTGT po ly ( A ) * 
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Fig.8b. 

(Cctb) 

pTbeta2 cDNA Sequence 

GGCTTCCCGGCGCGGATTGTGAGGGGTTCTCTCCTCTACGCGGAATCCGTCGGAACCATGGCTTCCCTTTCCCTCGCACCTGTTAATATC 

MASLSLAPVNI 

TTCAAGGCTGGAGCTGATGAAGAGAGGGCCGAGATAGCTCGCCTGTCGTCGTTTATCGGTGCCATCGCCATTGGAGACTTGGTGAAGAGC 
FKAGADEERAEIARLSSFIGAIAIGDLVKS 

ACTTTGGGACCGAAGGGCATGGACAAAATTCTCCTAAGCAGTGGACGAGACGCCGCTCTGATGGTGACCAACGACGGTGCTACCATTCTC 
TLGPKGMDKILLSSGRDAALMVTNDGATIL 

AAGAACATTGGTGTGGACAACCCCGCAGCAAAGGTTCTAGTTGATATGTCAAGGGTTCAAGATGATGAAGTTGGTGATGGCACTACCTCT 
KNIGVDNPAAKVLVDMSRVQDDEVGDGTTS 

GTTACTGTCTTAGCCGCAGAGCTGCTCCGGGAAGCAGAATCTTTAATTGCAAAAAAGATACATCCACAGACCATCATCTCAGGTTGGAGA 
VTVLAAELLREAESLIAKK.IHPQTIISGWR 

GAAGCCACAAAGGCAGCAAGAGAGGCCCTGCTGAGCTCCGCTGTGGATCATGGTTCTGATGAAGCCAGATTCTGGCAGGACTTAATGAAC 
EATKAAREALLSSAVDHGSDEARFWQDLMN 

ATTGCAGGAACGACATTGTCCTCAAAGCTCCTTACTCACCACAAGGACaVCTTTACTAAACTGGCCGTGGAAGCGGG 
lAGTTLSSKLLTHHKDHFTKLAVEAGLRLK 

GGCTCTGGCAACCTGGAGGCGATTCATGTCATCAAGAAACTAGGTGGGAGTCTGGCAGACTCCTATCTAGATGAAGGTTTTCTTTTGGAT 
GSGNLEAIHVIKKLGGSLADSYLDEGFLLD 

AAAAAAATTGGAGTAAATCAACCAAAGAGAATTGAAAATGCTAAAATTCTTATTGCAAATACTGGGATGGATACAGACAAAATAAAGATA 
KKIGVNQPKRIE NAKILIANTGMDTDKIKI 

TTTGGCTCTCGGGTAA6A6TTGATTCCACAGCAAAGGTTGCAGAGATAGAACATGCAGAAAAGGAGAAGATGAAGGAGAAAGTTGAACGT 
FGSRVRVDSTAKVAEIEHAEKEKMKEK,VER 

ATTCTTAAGCATGGAATAAATTGCTTTATTAACAGACAGTTAATTTATAATTACCCTGAACAACTCTTCGGCGCTGCTGGCGTCATGGCT 
ILKHGINCFINRQLIYNYPEQLFGAAGVMA 

ATTGAGCATGCGGATTTCGCAGGTGTGGAGCGCCTCGCTCTTGTCACAGGTGGTGAGATTGCCTCTACCTTTGATCACCCAGAACTTGTG 
lEHADFAGVERLALVTGGEIASTFDHPELV 

AAGCTTGGAAGTTGCAAACTTATTGAAGAAGTTATGATCGGGGAAGATAAACTCATTO^CTTTTCTGGGGTTGCCCTTGGTGAGGCATGC 
KLGSCKLIEEVMIGEDKLIHFSGVALGEAC 

ACCATTGTGCTTCGTGGTGCCACTCAGCAAATTCTGGATGAAGCTGAACGATCTCTGCATGATGCTCTTTGTGTTCTTGCTCAGACTGTA 
TIVLRGATQQILDEAERSLHDALCVLAQTV 

AAAGATCCTAGAACAGTTTACGGGGGAGGCTGCTCTGAGATGCTGATGGCCCATGCTGTGACACAGCTTGCCAACAGAACCCCAGGAAAA 
KDPRTVYGGGCSEMLMAHAVTQLANRTPGK 

GAAGCTGTAGCAATGGAGTCGTTTGCAAAAGCCCTGAGAATGTTGCCGACCATCA7AGCCGACAATGCGGGCTATGACAGTGCAGATCTG 
EAVAMESFAKALRKLPTI lADNAGYDSADL 

GTGGCACAGCTCCGAGCTGCTCACAGTGAAGGCCATATAACTGCTGGACTGGATATGAAGGAAGGTACCATCGGCGATATGGCAGTACTG 
VAQLRAAHSEGHITAGLDMKEGTIGOMAVL 

GGTATAACAGAGAGTTTTCAAGTGAAGCGACAGGTTCTTCTGAGTGCGGCTGAAGCAGCAGAGGTGATCCTGCGAGTGGACAACATTATC 
G I TE S FQVKRQVLLSAAE AA EV I L RVDK I I 

AAAGCAGCACCAAGGAAACGTGTCCCCGATCACCACCCCTGTTAAGCATTCCCATTTGCTGATGAACTCTGGGCCAGTTCATAGCAAAGT 
KAAPRKRVPDHHPC . 

TGTACTTGGAAGACTTCAACCTTTAAAGAAGACTGGTGGAATTGACCTGTCCATGATAGCCTTAAGTTTGAACATTAGCTGACCTTCTGT 



GTTAAACATGGGTCTAATTTATTTACTGTTTCATTTTCCATACAATTCAGTTGATTTACAAGTTCATTTCTCATACTGTGTATTAAAATA 



AAAATCCAGTTACTTAGCCCTTAAAAAAAAAAAA 
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Fig.Sc. 

(Cctg) 

pTgamma? cDNA Sequence 



CGTCGTGCCGCCATGATGGGCCACCGTCCAGTGCTCGTGCTCAGTCAGAACACAAAGCGTGAATCTGGAAGAAAAGTTCAGTCTGGAAAT 
MMGHRPVLVLSQKTKRESGRKVQSGN 

ATCAATGCTGCAAAGACAATTGCAGACATCATCCGGACCTGCTTGGGACCTAAATCTATGATGAAGATGCTTTTGGACCCAATGGGAGGC 
I K A A KTIADIIRTCLGPKSMMKMLLDPMGG 

ATCGTGATGACCAATGATGGCAATGCCATTCTTCGAGAGATTCAAGTCCAGCATCCCGCAGCAAAGTCCATGATTGAAATTAGCAGGACC 
IVMTNDGNAILREIQVQHPAAKSMIEISRT 

CAGGATGAAGAGGTTGGAGATGGGACCACATCAGTAATTATTCTTGCGGGAGAAATGCTCTCTGTGGCTGAACACTTCCTAGAGCAGCAG 
QDEEVGDGTTSVIILAGEMLSVAEHFLEQQ 

ATGCACCCAACAGTGGTGATCAGTGCTTACCGCATGGCACTGGATGATATGATCAGCACTCTGAAGAAAATCAGTACTCCTGTTGATGTC 
MHPTVVISAYRMALDDMISTLKKISTPVDV 

AATAACCGTGAGATGATGTTGAGCATCATCAATAGCTCTATTACTACAAAAGT»TCAGTCGGTGGTCCTCCTTGGCATGCAACAW 
NNREMMLSIINSSITTKVISRWSSLACNIA 

CTGGATGCTGTTAAGACTGTGCAGTTTGAAGAGAATGGCAGAAAGGAAATTGACATCAAAAAATATGCAAGGGTAGAAAAGATACCCGGG 
LDAVKTVQFEENGRKEIDIKKYARVEKIPG 

GGCATCATTGAAGACTCATGTGTCTTACGTGGAGTTATGATTAACAAGGATGTGACCCATCCAAGGATGCGCCGCTATATTAAGAATCCT 
GIIEDSCVLRGVMINKDVTHPRMRRYI KNP 

CGAATTGTGCTATTGGATTCTTCTTTGGAGTACAAGAAAGGAGAAAGCCAGACCGACATCGAGATTACCCGGGAGGAGGACTTCACGCGG 
RIVLLDSSLEYKKGESQTDIEITREEDFTR 

ATCCTGCAGATGGAGGAGGAGTACATCCATCAGCTGTGTGAGGACATCATCCAGCTGAAGCCTGACGTGGTCATCACAGAGAAGGGtt^ 
ILQMEEEYIHQLCEDIIQLKPDVVITEKGI 

TCAGATTTAGCTCAGCACTACCTCATGCGGGCCAATGTCACAGCCATTCGTAGAGTCCGGAAAACAGACAATAATCGCATTGCTAGAGCC 
SDLAQHYLMRANVTAIRRVRKTDNNR lARA 

TGTGGGGCACGGATAGTCAGCCGACCTGAGGAACTGAGAGAAGATGATGTTGGTACAGGGGCAGGCTTATTGGAAATCAAGAAGAT7GGA 
CGARIVSRPEELREDDVGTGAGLLEXKKZG 

GATGAGTACTTTACATTCATCACTGACTGCAAAGACCCAAAGGCCTGCUVCCATTCTTCTTAGAGGAGCCAGCAA^ 
DEYFTF ITDCKDPKACTILLRGASKEILSE 

GTAGAACGCAACCTCCAGGATGCCATGCAAGTGTGCCGCAATGTTCTACTGGACCCTCAGCTGGTGCCTGGTGGTGGAGCCTCTG 
VERNLQDAMQVCRNVLLDPQLVPGGG ASEM 

GCTGTGGCTCATGCCTTGACAGAAAAATCTAAGGCCATGACTGGTGTGGAACAATGGCCATATAGAGCTGTGGCCCAGGCGTTAGAGGTC 
AVAHALTEKSKAMTGVEQWPYRAVAQALEV 

ATCCCTCGGACCTTGATCCAGAATTGTGGGGCCAGTACCATTCGTCTGCTTACCTCCCTTCGGGCCAAGCACACACAGGAGAGTTGTGAG 
IPRTLIQNCGASTIRLLTSLRAKHTQESCE 

ACCTGGGGTGTGAATGGTGAGACTGGTACCTTGGTGGACATGAAAGAGCTGGGTATTTGGGAGCCATTGGCTGTGAAGCTACAAACGTAC 
TWGV NGETGTLVDMKELG IWEP LAVKLQTY 

AAAACAGCAGTAGAGACTGCAGTTCTGCTTCTGCGGATTGATGACATTGTCTCTGGCCACAAGAAGAAGGGTGATGACCAGAACCGGCAA 
KTAVETAVLLLRIDDIV SGHKKKGDDQNRQ 

ACTGGTGCTCCAGATGCTGGCCAGGAGTGAGTGCTGAGCACGGTGACTTCCATGCACAGAACCAGCAGTCTCCCCTTTCCTGAGCCAGAG 
TGAPDAGQE - 

TTCCAGGAACACTGTGGACATCTTTGTTTGCGAAGGATCAGGTTGAGGGGCAGCCCCCAGTCTGTCCCATCTCAGTTTGCAAAAAGCACT 



GACACGTATCTCTTCTATTGTAAGCTTTCCATTTAGTTTGCTTCCAATGATTAAATCTAAGTCATTTGAAAAAAAAAAAAA 
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Fig.Sd. 

(Ccfd) 

pTdelta2 cDNA Sequence 



GGCTCTCCAGCAGGCCCCTTGCTCGGCTTCCGCCTTCTCTCCCGGCGTCGCTTTCTGGAAGGTTCGTGGAGGAGGCGGTCAGGGAGACC 



GTTACTCCACAGCAAGCCGGAATCCGTGTCCATCCGTCCTCCTGAACCCGCGCAGACGCCACCAAGGTCGCCATGCCGGAGAACGTAGCT 

M P £ N V A 

TCCCGAAGCGGGGCGCCCACCGCCGGGCCCGGCAGCCGCGGGAAAAGCGCCTACCAGGACCGCGACAAGCCAGCCCAGATCCGCTTCAGC 
SRSGAPTAGPGSRGKSAYQDRDKPAQIRFS 

AATATTTCCGCGGCCAAAGCGGTTGCTGATGCTATTAGAACAAGCCTTGGACCTAAAGGAATGGACAAAATGATTCAAGATGGAAAAGGC 
NISAAKAVADAIRTSLGPKGMDKMIQDGKG 

GATGTGACCATTACAAATGATGGTGCCACCATTCTGAAACAAATGCAGGTATTGCATCCAGCAGCCAGAATGCTGGTGGAATTGTCTAAA 
DVTITNDGATILKQM QVLHPAARMLVELSK 

GCTCAAGACATAGAAGCAGGAGATGGCACCACGTCGGTTGTCATCATTGCTGGCTCTCTTTTAGACTCCTGTACCAAACTTCTGCAGAAA 
AQDIEAGDGTTSVVIIAGSLLDSCTKLLQK 

GGTATACATCCAACCATCATTTCCGAGTCATTCCAGAAAGCTTTGGAAAAGGGTCTTGAAATCCTTACTGACATGTCTCGACCTGTGCAA 
GIHPTI ISESFQKALEKGLEILTDMSRPVQ 

CTGAGCGACAGAGAAACTTTGTTAAATAGCGCAACTACTTCATTGAATTCAAAGGTTGTCTCTCAGTATTCAAGTCTACTCTCTCCAA 
LSDRETLLNSATTSLNSKVVSQYSSLLSPM- 

AGTGTCAATGCGGTGATGAAAGTGATTGACCCAGCCACAGCTACCAGTGTAGATCTTCGAGATATTAAAATAGTTAAGAAGCTTGGTGGG 
SVNAVMKVIDPATATSVDLRDIKIVKKLGG- 

ACAATAGATGACTGTGAGCTGGTGGAAGGCCTCGTTCTCACACAGAAAGTAGCAAATTCTGGCATAACAAGAGTTGAAAAGGCTAAGATT 
TI DDCELVEGLVLTQKVANSGITRVEKAKI 

GGGCTTATTCAGTTTTGCTTATCTGCTCCTAAAACAGATATGGATAATCAAATAGTAGTATCTGACTATGCCaiGATGGATCGAGTGCTT 
GLIQFCLSAPKTDMDNQIVVSDYAQMDRVL 

CGAGAGGAGAGAGCCTATATTTTAAATTTGGTGAAGCAAATTAAGAAAACAGGATGTAATGTCCTTCTCATACAGAAGTCTATCCTGAGA 
R EERAYILNLVKQIKKTGCNVLLIQKSILR 

GATGCCCTTAGTGATCTTGCATTACATTTTCTGAATAAGATGAAGATrTATGGTGGTTAAGGACGTTGAAAGAGAAGACATTGAATTCATC 
DALSDLALHFLNKMKlMVVKDVEREDIEri 

TGTAAGACAATTGGAACCAJUlCCAGTTGCTCACATTGACCAGTTaiCTGCTGAaiTGCTGGGTTCTGCTGAGTTATC 
CKTIGTKPVAHIDQFTADMLGSAELAEEVS 

TTAAATGGTTCTGGAAAACTATTCAAGATTACAGGTTGTACAAGCCCAGGGAAAACAGTTACAATTGTCGTACGTGGTTCTAACAAACTG 
LNGSGKLFKITGCTSPGKTVTIVVRGSKKL 

GTGATTGAAGAAGCTGAGCGCTCCATTCATGATGCTCTCTGTGTCATCCGATGCTTAGTAAAGAAAAGAGCTCTTATTGCAGGAGGTGGT 
VIEEAERS IHDALCVIRCLVKKRALIAGGG 

GCTCCAGAAATAGAGCTGGCCCTCAGACTGACAGAGTACTCCCGAACACTGAGTGGTATGGAGTCCTACTGTGTTCGTGCTTTCGCGGAT 
APEIELALRLTEYSRTLSGMESYCV RAFAD 

GCTATGGAAGTCATTCCATCTACACTAGCTGAAAATGCTGGCCTGAATCCCATTTCTACAGTAACAGAGCTAAGAAACCGCCATGCCCAA 
AMEVIPSTLAENAGLNPISTVTELRNRHAQ 

GGAGAAAAAACTACAGGCATTAATGTCCGAAAGGGTGGGATCTCCAACATTTTGGAGGAAATGGTTGTTCAGCCTCTGTTGGTGTCAGTC 
GEKTTG INVRKGGI SN ILEEMVVQPLLVS V 

AGTGCTTTGACCTTAGCAACTGAAACTGTGCGGAGOITTCTGAAAATCGATGATGTGGTAAATACTCGATAATCTGGATAAAAGGATG^ 
SALTLATE TVRS ILKI DD.VVNTR , 

TGACTGCATCATTATGGACAGAAGTACTGTGGCTGGAATGAAGGACAACCACCTTGTTCCTTGTCTGGAAGCTTCAGAGT^ 



TGTCTTCCAGTTGGCATTTGTTTGAAATTGTATTGAAACAATTTAATGAAAACATTAAATACTTGGTTTTAAACTCCAAAA 



AAAAAAA 
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Fig.Se. 

(Ccte) 

pTepsilonS cDNA Sequence 

GGATTCTGCGTCCTCTCGCGAGAGGGAGGTGCGGTGTGTCTCTCTTCCCGAGTGGTCCCCGCTGGTTACTGGAGGAGCGCTCGTTCTAGT 



TCGTCCACCATGGCGTCCGTGGGGACCCTCGCCTTCGATGAGTATGGGCGCCCCTTTCTCArrATCAAGGACCAAGATCGCAAGTCCCGT 
MASVGTLAFDEyGRPFLI IKD QDRKSR 

CTCATGGGGCTTGAGGCCCTCAAGTCTCACATCATGGCTGCCAAAGCTGTAGCAAACACAATGCGGACGTCACTGGGACCAAACGG 
LMGLEALKSHIMAAKAVANTMRTSLGPKGL 

GACAAGATGATGGTTGATAAGGATGGCGATGTGACTATAACAAACGATGGTGCCACCATTCTAAGCATGATGGATGTTGATCATCAGATT 
DKMMVDKDGDVTITNDGATILSMMDVDHQI 

GCCAAGCTGATGGTTGAACTGTCCAAATCCCAGGATGATGAAATTGGAGATGGGACCACAGGAGTGGTTGTCTTGGCTGGGGCCTTGTTG 
AKLMVELSKSQDDEIGDGTTGVVVLAGALL 

GAAGAAGCTGAACAGCTGCTGGACCGAGGCATTCACCCAATCAGAATTGCTGATGGCTACGAGCAGGCTGCCCGAATTGCAATACAACAC 
EEAEQLLDRGIHPIRIADGYEQAARIAIQH 

CTGGACAAGATCAGCGATAAAGTGCTTGTCGACATAAACAACCCTGAACCTCTGATT 

LDK ISDKVLVOINNPEPLI QTAKTTLGSKV 
ATTAACAGCTGTCACCGACAGATGGCTGAGATCGCCGTGAATGCCGTCCTCACGGTGGCAGATATGG 

INSCHRQMAE lAVNAVLTVADMERRDVDFE 

CTCATTAAAGTGGAAGGCAAAGTAGGTGGGCGTCTGGAAGACACCAAGCTCATAAAGGGTGTGATCGTCGACAAGGACTTCAGCCACCCA 
LIKVEGKVGGRLEDT-KLIKGVIVDKDFSHP 

CAGATGCCGAAAAAAGTGGTAGATGCTAAGATTGCGACT 

QMPKKVVDAKIAILTCPFEPPKPKTKHKLO 
GTCATGTCTGTGGAGGACTACAAAGCCCTGCAGAAGTACGAAAAGGAGAAGTTTG 

VMSVEDYKALQKYEKEKFEEMIKQIKETGA 
AACCTAGCTATTTGCCAGTGGGGCmACGATG^^ 

NLAICQWGFDDEANHLLLQNGLPAVRWVGG 
CCTGAGATTGAGpGATCGCCATTGCAACAGGAGGACGGATTGTCCCACGGTO 

^ ^ lEL lAIATGGRIVPRFSELTSEKLGFAG 
GTGGTGCAGGAGATCTCCTTTGGCACTACAAAAGACAAAATG^^ 

VVQE ISFGTTKDKMLVIEKCKNSRAVTIFI 
AGAGGAGGAAACAAGATGATCATAGAAGAA^ 

RGGNKMIIEEAKRSLH OALCVIRNLIRDNR 
GTTGTGTATGGAGGAGGGGCAGCCGAAATATC 

VVYGGGAAEISCALAVSQEAOKCPTLEQYA 
ATGAGAGCTTTTGCAGATGCCTTGGAGGTCATCCCCATGGCCCnrrTCAGA^ 

MRAFADA.LEVIPMALSENSG MNPIQTMTEV 
CGAGCCAGACAGGTGAAGGAGTCTAACCCTGCCCTGGGGATTGACTGTTTGCACAAGGGCAGT^^ 

RARQVKESNPALGIDCLHKGSNDMQYQHVI 
GAAACCTTGATTGGCAAAAAGCAGCAGM^^ 

ETLIGKKQQISLATQMVRMILKIDDIRKPG 

GAATCTGAAGAATAAACTGTACCATTTACCACTGTGACTAAATAAAGGGTGTGTCTGTTAAAAAAAAAAAAAAAAA^^ 
£ 5 E £ . 
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GAAGACCCCGCAGAGAGCACGTTGTTCTCGGCCTCTCCCGGAGCTAGGCCAGCCATGGCGGCGGTAAAGACCCTAAATCCGAAGGCCGAG 

MAAVKTLNPKAE 

GTGGCCCGGGCCCAGGCAGCGCTGGCGGTGAACATCAGCGCGGCTCGGGGCCTGCAGGATGTTCTGAGGACCAACTTGGGGCCTAAGGGC 
VARAQAALAVNISAARGLQDVLRTKLGPKG 

ACCATGAAGATGCTTGTATCTGGTGCTGGAGACATCAAACTTACTAAAGATGGCAATGTGCTGCTTCATGAAATGCAAATTCAACACCCA 
TMKMLVSGAG DIKLTKDGNVLLHEMQIQH P 

ACAGCCTCTTTGATAGCAAAAGTGGCTACAGCCCAGGATGACATAACTGGCGATGGCACTACATCCAATGTCCTCATCATCGGGGAGCTG 
TASLIAKVATAQDDITGDGTTSNVLIIGEL 

CTCAAACAGGCGGACCTGTACATTTCTGAAGGTCTTCACCCAAGAATAATAACTGAAGGTTTTGAAGCGGCAAAAGAAAAGGCACTCCAA 
LKQADLYISEGLBPRZ ITEGFEAAKEKALQ 

TTTCTGGAACAAGTCAAAGTAAGCAAAGAGATGGACAGAGAAACACTCATCGATGTGGCCAGGACATCTCTGCGGACTAAAGTTCATGCT 
FLEQVKVSKEMDRETLIOVARTSLRTKVHA 

GAACTTGCAGATGTCTTGACAGAGGCTGTAGTGGACTCCATCTTGGCCATTAGGAAAAAGGACGAGCCCATTGACCTCTTCATGGTTGAG 
ELADVLTEAVVDSILAIRKKDEPIDLFMVE 

ATCATGGAGATGAAGCATAAATCTGAGACAGATACAAGCTTAATCAGAGGGCTTGTTTTGGATCATGGAGCTCGGCATCCTGATATGAAG 
IMEMKHKSETDTSLIRGLVLDHGARHPDMK 

AAGAGAGTGGAAAATGCCTACATCCTCa.CGTGCAACGTGTCCTTAGAGTATGAGAAAACAGAAGTGAATTCTGGGTTTTTTTACAAGAGT 
KRVENAYILTCKVSLEYEKTEVNSGFFYKS 

GCAGAAGAGAGAGAAAAACTAGTAAAGGCTGAAAGAAAATTCATTGAAGATAGAGTTAAAAAAATCATAGAGCTGAAAAAGAAAGTCTGT 
AEEREKLVKAERKFIEDRVKKI lELKKKVC 

GGTGACTCAGATAAAGGATTTGTCGTTATTAATCAAAAGGGGATTGACCCCTTTTCCTTAGATGCCCTTGCGAAAGAAGGGATCGTAGCT 
GDSDKGFVVINQKGIDPFSLDALAKEGIVA 

CTGCGCAGAGCCAAGAGGAGAAACATGGAGAGGCTGACACTTGCTTGTGGTGGGATAGCTCTGAATTCCTTTGATGACCTGAATCCTGAC 
LRRAKRRNMERLTLACGGIALNSFDDLNPD 

TGTTTGGGACATGCAGGGCTTGTCTATGAGTATACACTGGGTGAGGAGAAGTTCACCTTTATTGAGAAGTGTAACAATCCCCGTTCTGTC 
CLGHAGLVYEYTLGEEKFTFIEKCNNPRSV 

ACTTTACTGGTTAAAGGACCAAATAAGCACACACTGACTCAAATCAAGGATGCAATAAGAGATGGCTTGAGGGCTGTCAAAAATGCTO 
TLLVKGPNKHTLTQIKDAIRDGLRAVKNAI 

GATGATGGCTGTGTTGTCCCAGGTGCGGGTGCAGTAGAAGTGGCACTGGCAGAAGCTCTGATTAAATACAAGCCCAGTGTGAAGGGCAGG 
DDGCVVPGAGAVEVALAEALIKYKPSVKGR 

GCGCAGCTTGGAGTCCAGGCATTTGCAGATGCCTTGCTCATCATTCCCAAGGTTCTTGCGCAAAACTCTGGTTTTGACCTTCAGGAAA 
AQLGVQAFADALLIIPKVLAQNSGFDLQET 

TTAGTTAAAGTTCAAGCTGAACATTCAGAATCGGGCCAGCTCGTAGGTGTGGATCTGAGCACAGGTGAGCCGATGGTGGCCGCAGAGATG 
LVKVQAEESESGQLVGVDLSTGEPMVAAEM 

GGTGTGTGGGATAACTACTGTGTGAAGAAGCAGCTGCTACACTCCTGTACTGTGATCGCCACCAACATTCTCCTGGTCGACGAGATCATG 
GVWDNYCVKKQLLHSCTVIATNILLVDEIM 

CGAGCTGGGATGTCCTCTCTGAAGGGTTGAGGCCTGCCTGTGATACTACAGGATGTTGGGGGGAATGGTTATTTTTGTCCAAGCTTCAAG 
RAGMSSLKG . 

TGATTTGGAAAAAAATTTTCTCTTCCTGATTGGAGAAAAGAAACGGGACATTTGACACCTATTCAAATTATACTGTAAAATTTTATTTTA 
TTTTTGCCTTGAGTATCTGAAGACACTCAAAGCAGCTCTTTTTCAACCCACTGAACAAGATGTTTTAGCTACACCGATACAAAAATTACA 
TAATAAGATAAGCATGTTGTCTACCCTTGTTCCATAAGTGTTCTTTGAAAGTTTGTAATGGTTTTCTCCTAAATAAGGCATGGTGACACA 
TGCCTGTAAGCCTAGCCCrrrGGAAATAGTCCGGAATTTCTATGCCAACTCAGGCTACAGGAGACCCCAGGTCGAAAGAATAATTTGTTG 
TGGATGTATTTGAAATTATCCAGCCAACTCCCTGTTAAACATGTAAGATCCTTGCCAGTGTAAAACACATCTGGGTAATTTATGGGTTGC 
ATAATGTCTAATAAATACTTAAAAGAGTGAAAAAAAAAAAAAAAA 
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Fig.Sg. 

(Ccth) 

pCBLSO cDNA sequence 

agatgatgcccacacgagttatcctgttgaaagagggtactgatagctcccagggcatccctcagctcct 
mmptpvillkegtdssqgipqlvsnisac 

caagtgattgctgaggctgtaagaacx:accctgggtccccgtggcatggacaaacotat^^ 

QVIAEAVRTTLGPRGMDKLIVDGRGKATIS 
AATGATGGGGCCACAATTCTGAAACTCCTCGATGTTGTCCATCCTGCAGCAAAGACTTTA^^ 

NDGATILKLLDVVHPAAKTLVDIAKSQDAE 
GTTGGTGATGGCACCACCTCAGTGACCCTGCTGGCTGCGGAGTTTCTGAAGCAGGTGAAGC^ 

VGDGTTSVTLLAAEFLKQVKPYVEEGLHPQ 
ATCATCATCCGAGCTTTCCGCACAGCCACCCAATTGGCTGTTAACAAAATCAAAGAGATAGCTGTC^ 

I I IRAFRTATQLAVNKIKEIAVTVKKQDKV 
GAGOIGAGGAAGATGCTGGAGAAGTGTGCGATGACAGCCCTGAGCTCCAAGCTGATCTCC^ 

EQRKMLEKCAMTALSSKLISQQKVFFAKMV 
GTTGATGCCGTGATCATGCTTGACGAGCTGCTGCAGCTTAAAATGATTGGCATC^ 

VDAVMMLDELLQLKMIGIKKVQGGALEESQ 
CTAGTTGCTGGTGTTGCGTTCAAGAACSACTTTCTCTTATGCTGGGTTTGAAAT^ 

LVAGVAFKKTFSYAGFEMQPKKY KNPKIAL 
TTAAATGTTGAGCPTGAGCTGAAAGCAGAGAAAGATAATGCTGAAATCAGAGTCCACA 

LNVELELKAEKDNAEIRVHTVEDYQAIVDA 
GAGTGGAATATTCTCTATGACAAGTTAGAGAAGATCCATCAGTCTGGAGCCAAAG^ 

EWNILYDKLEKIHQSGAKVILSKLPIGDVA 
ACCCAGTACTTTGCTGATAGGGACATGTTCTGTGCTGGCCGAGTGCCTG^ 

TQYFADRDMFCAGRVPEEDLKRTMMACGGS 
ATCCAGACCAGTGTGAATGCnCTGGTTCCAGATGTGerGGGCCACTTC 

IQTSVNALVPDVLGHCQVFEETQIGGERYN 
TTCTTCACTGGCTGCCCTAAGGCCAAGAOITGTACCATCATCCT^ 

FFTGCPKAKTCTIILRGGAEQFMEETERSL 
CATGATGCCATCATGATTGTGAGGAGGGCCATCAAGAATGACTCTGTGG 

HDAIMIVRRAIKNDSVVAGGGAIEMELSKY 
CTGCGGGATTACTCGAGGACCATTCCTGGGAAGCAGCAGCTGTTGATTGG^ 

LRDYSRTIPGKQQLLIGAYAKALEIIPRQL 
TCTGACAACGCTGGCTTTGATGCCACAAAC^ 

CDNAGFDATNILNKLRARHAQGGMWYGVDI 
AACAATGAGAAOITCGCCGACAACTTCCAGGCATTTGTGTGGGAGCCAGCCA^^ 

NNEN IADN FQAFVWEPAMVRIN ALTAASEA 
GCATGCCTTATTGTGTCCGTGGATGAGACTATCAAGAACCCCCGCTCCACTGT^ 

ACLIVSVDETIKNPRST VDPPAPSAGRGRG 

CAAGCCCGCTTCCACTGAGAGGCGAGGCGGTCTGCACCTCCTTGT 
Q A R F H . 

CTCACTGAGGTTATTTAAATAAAGC 
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Fig.Sh. 

(Ccfq) 

pTthetal cDNA sequence 



CCCATTCCCAGCAGGCAGTGCGCGGCGCCCCTTGCTTCCTCGGCGTCTCGGAGCGAGCGCGTGAGCGGTC^ 

M 

GCGCTTCACGTCCCCAAGGCCCCGGGCTTTGCCCAGATGCTCAAGGATGGAGCCAAACATTT^ 

ALHVPKAPGFAQMLKDGAKHFSGLEEAVYR 
AATATACAGGCCTGCAAGGAGCTTGCTCAGACTACTCGCACAGCGTACGGACCAAATGGAAT 

NIQACKELAQTTRTAYGPNGMNKMVINRLE 
AAGTTGTTTGTGACAAACGATGCAGCGACTATTTTAAGAGAGCTAGAAGTGCAGCATCCTGCTGCA^ 

KLFVTNDAATILRELEVQHPAAKMIVMASfi 
ATGCAAGAACAAGAGGTTGGTGATGGCACAAACTTCGTTCTGGTGTTCGCTGGGGCTCTTCTGGAACTAGCT 

MQEQEVGDGTNFVLVFAGALLELAEELLRI 

GGCCTGTCAGTATCAGAGGTAATATCGGGGTATGAAATAGCTTGCAAAAAAGCTCATGAGATCCTTCCTGAGTT^ 
GLSVSEVISGYEIACKKAHEILPELVC.CSA 

AAAAACCTTCGAGATGTTGATGAAGTGTCCTCTCTGCTTCGAACCTCCATCATGAGTAAGCAGTACGGCAGCGAG 
KNLRDVDEVSSLLRTSIMSKQYGSETFL.AK 

CTTATTGCTCAGGCTTGTGTTTCTATCTTTCCTGATTCTGGCAATTTCAATGTTGATAACAT^ 

LIAQACVSIFPDSGNFNVDNXRVCKILG>SG 
ATTTATTCATCCrCJ^GTATTACATGGCATGGTTTTTAAGAAGGAAACTGAAGGTGATGT 

lYSSSVLHGMVFKKE'TEGDVTSVKDAKIAV 
TACTCTTGTCCGTTTGATGGCATGATAACAGAGACAAAGGGGACCGTGCTGATTAAGACTGCC^ 

YSCPFDGMITETKGTVLIKTAEELM NFS KG 
GAGGAGAACCTCATGGATGCTCAGGTGAAGGCCATTGCAGGCACTGGTGCAAATGTCATAGTAACAGG^ 

EENLMDAQVKAIAGTGANVIVTGGKVAD IA 
CTTCATTATGCTAACAAGTACAATATCATGTTGGTGAGACTGAACTCAAAGTG 

LHYANKYNIMLVRLNSKWDLRRLCKTVGAT 
GCTCTTCCAAAATTGACTCCTCCCGTCCAAGAAGAAATGGGACATTGTGACAGTGTT^ 

ALPKLTPPVQEEMGHCDSVYLS EVGDTQVV 
GTTTTTAAGCATGAAAAAGAAGATGGTGCCATTTCTACTATAGTTCTTCGAGGT^ 

VFK H EKEDGAI ST IVLRGSTDNLMDD IERA 
GTAGATGATGGAGTTAATACTTTCAAAGTTCTCACAAGGGATAAGCGTCTTGTACCTG^ 

VDDGVNTFKVLTRDKRLVPGGGATEIELAK 
CAAATCACATCATATGGAGAGACGTGTCCTGGGCTTGAACAGTATGCTATTAAGAAGTTTGCTGAAG^ 

QITSYGETCPGLEQYAIKKFAEA FEAIPRA 
CTGGCAGAAAATTCTGGCGTGAAGGCCAATGAAGTTATCTCTAAACTTTATTCCGTACACCi^ 

LAEN SGVKANEVISKLYSVHQEGNKNVGLD 
ATCGAGGCTGAAGTCCCTGCTGTAAAGGATATGTTAGAAGCCAGTATTTTAGATACTTACTTGGGAAAATACT 

I EAEVPAVKDMLEAS ILDTYLGKYWAIKLA 
ACTAATGCTGCAGTCACTGTACTAAGAGTGGATCAGATCATCATGGCAAAACCAGCTGGTGG 

TNAAVTVLRVDQI IMAKPAGGPKPPSGKKD 

TGGGATGACGACCAGAATGACTGAGGAACTTGTCATAGGTTAAGAGTTGTGTTTGTAGAGTAGAACTTGCCCAG^ 
W D D D Q N D . 
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Fig. 11b, 
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